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ABSTRACT 



The relativistic electromagnetic projection operators discovered by Fradkin are 
used to obtain a covariant decomposition of the motion of a relativistic charged 
particle into parallel motion and perpendicular gyration. The Lagrangian Lie 
transform method of Littlejohn is used to achieve a transformation to guiding- 
center coordinates in which the rapid oscillatory motion is removed. The natu- 
ral guiding-center Poisson bracket structure and Hamiltonian are derived. The 
guiding-center equations of motion are presented to one order higher than the 
usual drifts, and the correction to the gyromomentum is given. Correspondence 
with the usual noncovariant results, as given by Northrop, is demonstrated. 
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It is possible to add one or more eikonal wave perturbations to the La- 
grangian action for a single particle before performing the guiding-center trans- 
formation. It is shown that such perturbations can be written in manifestly 
gauge-invariant form in guiding-center coordinates; this observation allows us 
to develop a manifestly gauge-invariant oscillation-center theory to arbitrarily 
high order. In this way, again using Lagrangian Lie transforms, we obtain the 
ponderomotive Hamiltonian. 

By summing the guiding-center Lagrangian action over the full distribution 
of guiding centers and adding the Maxwell action, we obtain the total action 
of a guiding-center plasma. Upon variation of this total action, we find a self- 
consistent set of covariant relativistic kinetic and field equations; from these we 
can identify the guiding-center current density and the guiding-center magneti- 
zation. Upon application of Noether's theorem, the total action yields covariant 
conservation laws for the momentum-energy and the angular momentum of a 
relativistic guiding-center plasma; from these we can identify the guiding-center 
stress-energy tensor and the guiding-center spin angular momentum tensor. 

By summing the Lagrangian action for a guiding/oscillation center over the 
full distribution and adding the Maxwell action, variation yields self-consistent 
relativistic kinetic and field equations for the plasma in the wave field, including 
the dispersion relation for the wave; from these we can identify the wave magne- 
tization and susceptibility, thereby demonstrating the K-x theorem. Noether's 
theorem then yields conservation laws for the guiding-center plasma in the pres- 
ence of a wave field, including the wave contribution to the stress-energy and 
spin angular momentum tensors. 
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Chapter 1 

Introduction 



There was a time when a thorough working knowledge of geometry was consid- 
ered an indispensible ingredient in the education of a natural philosopher. From 
Euclid's first systematization of the subject more than two thousand years ago to 
well after the end of the Renaissance, the study of the Elements was considered 
a critically important part of mathematical instruction. Indeed, when reading 
Newton's Principia or Opticks, one is struck by the prevalence of geometrical 
arguments and descriptions. 

Alas, the introduction of coordinate systems by Descartes and the concommi- 
tant analyticization of geometry changed all this. Using coordinates, geometrical 
problems could be reduced to algebraic problems. The perceived need for good 
geometrical intuition gradually disappeared. By the time Whittaker's Treatise 
on the Analytical Dynamics of Particles and Rigid Bodies was first published in 
1904, this attitude had taken hold to the extent that Whittaker apparently felt 
no need to include illustrations in his nearly five-hundred-page-long (and other- 
wise excellent) document. At present, one can obtain an undergraduate degree 
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in physics or even applied mathematics with httle more geometry background 
than is found in a secondary school textbook. 

That this trend is disastrous has been appreciated only for the past couple 
of decades. This appreciation has been due, in large part, to modern develop- 
ments in the general theory of relativity. The entire lesson of relativity theory 
is that physical laws ought not to depend upon the coordinate system chosen to 
describe them; that is, the meaning of physical laws transcends their coordinate 
description. Conversely, coordinate descriptions can have a way of masking fun- 
damental physical reality. Thus, a coordinate-free description of physical laws 
can have the beneficial effect of allowing one more easily to glimpse the under- 
lying fundamental physical reality. Such coordinate-free mathematical language 
is available, thanks in large part to the works of Cartan and Lie. Modern differ- 
ential geometry, including the exterior calculus and the theory of Lie groups, is 
capable of providing a coordinate-free description of physical law. Please note 
that what is being argued here is that such a coordinate-free description is far 
more than just an alternative mathematical notation; the contention is that it 
yields an improved understanding of the physics involved. A physicist who takes 
the time to learn how, say, electromagnetic theory can be described in terms 
of differential forms will have, as a result, an improved understanding of the 
electromagnetic field. 

There is an additional benefit to the geometrical point of view. Just as 
Descartes found that algebra can be used as a tool for obtaining geometrical 
results, likewise geometry can be used as a tool for obtaining analytical results 
that would be far more difficult to obtain any other way. Several examples of 
this phenomenon will be pointed out in the course of this thesis. 

Since the 1960's it has been known that classical mechanics is describable 
in terms of symplectic geometry. This observation paved the way for powerful 
generalizations of some of the traditional methodologies of mechanics. For ex- 
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ample, whereas Hamiltonian mechanics had been originaUy formulated in terms 
of canonically conjugate pairs of coordinates, it was found that noncanonical 
coordinates could be used instead, oftentimes to great advantage. Powerful new 
types of perturbation theory, based on Lie transforms, were introduced; this 
made higher-order perturbative treatments less laborious and more systematic. 

Nowhere was the impact of this revolution more profound and beneficial than 
in the field of plasma physics. Because the motion of charged particles in com- 
plicated electromagnetic geometries and in wave fields requires a perturbative 
treatment, it is not surprising that Lie transform perturbation theory was shown 
to be a natural tool for systematizing, simplifying and better understanding many 
of the calculations of plasma physics. Furthermore, it was shown that the most 
natural treatment of the guiding-center problem (i.e. the ubiquitous problem of 
computing the drifts of a charged particle gyrating in a slowly-varying electro- 
magnetic field) involved the use of noncanonical coordinates and noncanonical 
coordinate transformations. All of this will become more clear as we proceed. 

During the late nineteen seventies, Dewar introduced the idea of canonical 
oscillation-center transformations. Johnston and Kaufman j2] and Johnston jS] 
used canonical perturbation theory to perform oscillation-center and mode cou- 
pling analyses for the Vlasov plasma. In Gary's PhD thesis |5, Lie transforms 
were shown to be a useful tool for ponderomotive theory, and the K-x theo- 
rem jS] relating the ponderomotive Hamiltonian with the linear susceptibility 
was formulated. 

The extension of these techniques to magnetized plasma was made possi- 
ble, or at least greatly facilitated, by Littlejohn's work on the guiding-center 
problem in his PhD thesis jB]. Littlejohn made the key observation that the 
transformation from single-particle to guiding-center coordinates was best done 
using noncanonical methods. This noncanonical transformation was done in his 
thesis by using the Darboux theorem constructively, and it was followed by a 
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canonical Lie transformation that averaged over the rapid gyromotion. Subse- 
quently, Littlejohn ^ discovered that the entire transformation could be done 
by a single Lie transform with a vector generator. This is the approach followed 
in this thesis. 

Ponderomotive theory for a magnetized relativistic plasma was then done by 
Grebogi and Littlejohn jS|, who used canonical Lie transforms. They pointed 
out that the oscillation-center transformation for a magnetized plasma might 
best be handled by noncanonical Lie methods, but they did not do it this way. 
Their result was subsequently simplified by Gary and Newberger jHj. 

Meanwhile, Dubin, Krommes, Oberman and Lee ^U] showed how to use Lit- 
tlejohn's methods to derive self-consistent gyrokinetic equations for an electro- 
static plasma, including the Poisson equation whose source term was written in 
terms of the guiding-center distribution function. Kaufman and Boghosian fH] 
showed that this calculation could be done by summing the guiding-center ac- 
tion over the entire distribution and coupling it to the Maxwell action; variation 
with respect to the coordinate fields (considered to be functions of their initial 
conditions) then yields the gyrokinetic equation, and variation with respect to 
the vector potential then yields the self-consistent field equaton. Finally, Sim- 
ilon ^2] showed that conservation laws for the guiding-center plasma could be 
obtained by application of Noether's theorem to this system action. 

The above-mentioned work by Grebogi and Littlejohn was done for a rela- 
tivistic plasma, but was not manifestly covariant in that it was done in "1 + 3" 
notation. A manifestly covariant treatment is made possible with the help of 
certain projection operators which were introduced by Fradkin ^H] who obtained 
the drifts for a relativistic guiding center (but did not use Lie methods) , and by 
Dumais |14j . 

The general plan of this thesis is as follows: 

Ghapter 121 will cover the mathematical preliminaries necessary to understand 
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the differential geometric arguments used in this thesis. It should be emphasized 
that this constitutes no more than a sketchy introduction, and is no substitute 
for a good text on the subject; nevertheless it is probably sufficient to enable 
a persistent person with an undergraduate background in physics to read and 
understand this entire text. Chapter |21 also describes the application of these 
techniques to Hamiltonian and Lagrangian mechanics; specifically, Lie transform 
perturbation theory is introduced here and many simple examples of its use are 
presented. 

Chapter El will treat the guiding-center problem for a relativistic charged 
particle. We shall begin by examining the geometry of the electromagnetic field 
in four-dimensional spacetime, and we shall find that there is a covariant way 
to isolate the rapidly-gyrating component of the particle's four velocity. Lie 
transform perturbation theory is then applied to the particle's phase-space La- 
grangian in order to remove this rapidly-gyrating component and thus obtain 
the residual parallel and drift motion. The perturbative calculation is carried 
out to one order higher than the usual drifts, the natural guiding- center Poisson 
bracket structure and Hamiltonian are presented, and the correction to the gyro- 
momentum is given. Finally, it is shown how to cast these results in a manifestly 
gyrogauge invariant format. 

In Chapter 0] we shall study the effects of eikonal wave perturbations on a 
guiding center, once again using Lie transform perturbation theory. The result 
is a complete ponderomotive description of the relativistic guiding center in an 
eikonal wave field, and we show how to cast this in manifestly gauge-invariant 
form. To achieve manifest gauge-invariance, we shall find it necessary to abandon 
the usual approach of expanding the eikonal wave perturbation in a series of 
Bessel functions of k±p. Instead, we shall first perform a Lagrangian gauge 
transformation, and then we shall expand in a series of special functions that 
are related to indefinite integrals of Bessel functions. The required Lagrangian 
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gauge transformation is not obvious, and it would never have been discovered 
without the use of differential geometric techniques. Finally, the ponderomotive 
Hamiltonian is derived using Lie transforms. 

In Chapter [H] we shall sum the resulting guiding-center Lagrangian over the 
entire distribution of particles present in a plasma, and couple with the Maxwell 
field to obtain the total Lagrangian for a Vlasov plasma of relativistic guid- 
ing centers. By varying this it is possible to derive a self-consistent gyroki- 
netic description of such a plasma, including the magnetic moment tensor, in 
manifestly-covariant format. Application of Noether's theorem then yields con- 
servation laws for the guiding-center plasma, and these are also cast in manifestly 
covariant form. Finally, using the results of Chapter |3[ the conservation laws 
are derived for a guiding-center plasma in the presence of a wave field. 

In Chapter IHl we discuss some of the unanswered questions raised by this 
study. These could be topics for future research. 

Appendix 1X1 is a glossary of the mathematical symbols and notation used in 
this thesis. 

Appendix IbI is a review of some of the more primitive mathematical concepts 
used in this thesis, such as vector spaces, dual spaces, algebras, and modules. 

Appendix O applies vector Lie transforms to the nonrelativistic guiding- 
center problem in two dimensions, and derives the shift in gyro frequency due 
to spatial gradients in the magnetic and (perpendicular) electric fields. This is 
useful both as a demonstration of the vector Lie transform technique, and as a 
comparison to the techniques and results of Chapter |H1 

Appendix iDl derives and discusses the properties of a pair of special functions 
that were introduced in Chapter HI 

Appendix |E1 is a short tutorial on how to derive Bessel function sum rules, 
including (but not limited to) those that were useful in Chapter HI 



Chapter 2 



Mathematical Preliminaries 



2.1 Discussion 

This chapter divides naturaUy into three sections. The first covers the basic 
results of differential geometry that are necessary to understand the rest of this 
thesis. This includes the calculus of tensors and the exterior algebra. To reit- 
erate, the exposition here is not intended to replace a good introductory book 
on the subject (see, for example, the excellent introductory texts by Schutz [TH], 
Edelen jTHj, Singer and Thorpe jTT], or Burke [HI), but it does present enough 
material to make the thesis self-contained, and to establish notational conven- 
tions. The theory of Lie groups has been omitted from this section because it 
is not absolutely essential to the understanding of what follows, but the reader 
with background in this area will be at a definite advantage. 

Next, these tools are used to reformulate Hamiltonian and Lagrangian me- 
chanics. The generalization to noncanonical coordinates is discussed, including 
those with singular Poisson structures. Noether's theorem is formulated, and 
numerous worked examples are given. Mechanical systems with constraints are 
examined from this new point of view. 
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Mathematical Preliminaries 



Finally, Lie transform perturbation theory is presented, and its use for non- 
canonical coordinates is discussed. Because we shall use Lie transforms in a more 
general context than that in which they are usually presented, I recommend that 
this section be read even by those already familiar with the subject. 

2.2 Differential Geometric Concepts 
2.2.1 Manifolds, Vectors, and Covectors 

In this subsection, we shall discuss the ideas that are necessary to reformulate 
tensor calculus in a fashion that more directly illustrates the geometrical founda- 
tions of the subject. Appendix |B] goes one level deeper, and gives set-theoretical 
definitions for many of the primitive terms that we shall use here (such as vector 
s^^ace and algebra). 

A manifold is a space that is locally Euclidean and in which there is a notion 
of differentiation. This can be made more precise as follows: There must be a 
differentiable one-to-one map, or diffeomorphism, from the neighborhood of any 
point of a manifold to the points of 3?", for some n. Such a map is called a chart, 
and the collection of all such maps for a given manifold is called an atlas. There 
is an additional requirement that two maps in the same atlas that overlap must 
do so smoothly; this means, among other things, that all charts in the same 
atlas must map to 3?" with the same n. The number n is thus characteristic of 
the entire manifold, and is called the dimension of the manifold. 

A chart is realized by (local) coordinates on the manifold. Since an n- 
dimensional manifold, M, must map smoothly onto 9?", it must be possible 
to label the points of M, at least locally, by n numbers, say z^, . . . ,z'^. Then 
the map is given by expressing these numbers as functions of the coordinates, 
x^, . . . , a;"", on 3?". Specifically, we write z°'{x'^, . . . , a;""), for a = 1, . . . , n. 

It is generally not possible to cover an entire manifold with one chart. For 
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example the surface of a sphere is a manifold called S'^, and, as is well known, 
coordinate charts on S"^ must break down somewhere. The chart 

40 

X — — cos (p (2.1) 
on 

4:9 

y^—smcP, (2.2) 

where 9 and (p are the usual spherical coordinates (colatitude and azimuthal 
angles, respectively), maps the region < 9 < 3n/4 onto the open unit disk in 
The chart 

4(n-9) , , 

x = -— ^cos(/) (2.3) 

OTT 

,= l(!LZ^),i„^ (2.4) 

then maps the region t\ j 4 < 9 < t\ onto the open unit disk in 3?^. These two 
charts are thus sufficient to cover all of 5"^, and therefore constitute an atlas. 
Any atlas for S'^ must contain at least two charts. In general, the number of 
charts needed to cover a manifold depends on its global topological properties. 

A mapping from an m-dimensional manifold onto an n-dimensional manifold 
is called an injection if m < n, a projection if m > n, and a bijection ii m = n. 
Consider a map from 3? to an n-dimensional manifold, M. That is, 3? i— > M. Note 
that this is an injection if n > 1, and a bijection if n = 1. This map defines a 
path through the manifold, M. The points in M that are on the path are those 
in the range of the map. The realization of this mapping is given by expressing 
each of the coordinates on M as functions of the coordinate, x, on 3fJ. That is, 
we write z'^{x) for a = 1, . . . , n. As a; varies along 3?, the coordinates z"" trace 
out the path in M. Note that although we keep writing down the coordinate 
realizations of these things, the notion of a map from one manifold to another 
has an intrinsic geometrical meaning as an association of members of one set of 
points with members of another set of points, consistent with local topological 
properties of nearness, etc. 
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Now let P be a point on the above-mentioned path through the manifold, 
M. Denote its coordinates hj Zp, . . . , Zp. Since it lies along the path, there must 
exist a coordinate, xq, of a point in 3?, such that Zp = z°'{xq) for a = 1, . . . , n. 
Now consider the derivatives of the functions, z°'{x), with respect to the path 
parameter, x. Denote these by dz"" /dx. Evaluate these at the point P. This gives 
the n numbers, 

- (2.5) 

associated with the point, P. 

It is clear that there are many different curves passing through point P that 
will yield the same set of n numbers. Indeed, any curve whose coordinates near 
P are given by 

2« = 2« + V'^Sx + 0{5x^) (2.6) 

where 5x = x — xq, will do so. The identification of these n numbers thus gives 
us a way to partition the set of all curves passing through point P into (an 
infinity of) equivalence classes; two curves are said to be equivalent if they yield 
the same set of n numbers. That is, two curves are equivalent if they both have 
the form given in the above equation (with the same V"^s). 

Consider the set of equivalence classes of curves thus obtained. We can 
define addition and scalar multiplication among the elements of this set in the 
following very natural way: The equivalence class of curves with the n numbers 
adds to the equivalence class of curves with the n numbers to yield 
the equivalence class of curves with the n numbers + U". The scalar o 
multiplies the equivalence class of curves with the n numbers V" to yield the 
equivalence class of curves with the n numbers aV". With these operations, we 
have converted the space of all equivalence classes of curves through the point P 
into a vector space. This vector space will be called the tangent space at point 
P of the manifold. Its elements have been introduced as equivalence classes of 



Mathematical Preliminaries 



11 



curves, but it will become clear momentarily that these may be identified with 
the usual notion of vectors as arrows with a certain magnitude and direction 
and with certain transformation properties. Note, however, that the base of the 
arrow is not free to move around, but rather is "pinned down" at the point P. 
There is a different tangent space at each point of a manifold, and vectors in one 
tangent space may not be added to vectors in another different tangent space. 
Note that the dimension of a tangent space is equal to the dimension of the 
manifold (in the above case, the dimension is n). 

It is evident that the above-described n numbers associated with an 
equivalence class of curves depend on our choice of coordinates for M. If our 
coordinates on M had been 2'", then the n numbers would have been 



where we have adopted the convention of summation over repeated indices. 
Readers familiar with traditional presentations of tensor calculus will recognize 
this as the transformation law for components of contravariant vectors. 

Recall that even though the components of a vector may vary from one 
coordinate system to another, the vector itself, as an abstract mathematical 
object, is an invariant geometrical concept. That is, given two sets of basis 
vectors, Gq, and e^, we can write the components of a vector V as in the first 
system and as V" in the second. Though these will, in general, be different, the 
abstract vector V = V^ea = V^e'^ retains its form under the change of basis. 

So how can we introduce bases in our tangent spaces that will refiect this 
idea? Despite the fact that the above-described n numbers are coordinate- 
dependent, if we form a first-order linear differential operator by using them as 
coefficients 

we see that this operator retains its form under a coordinate transformation. 
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This much is clear from the above equation. By analogy with the argument in 
the preceeding paragraph, we can thus identify the operator V with the vector 
V, and the n operators d/dz°' with basis vectors that span the tangent space. 
Thus the idea of vectors as arrows, as equivalence classes of curves, and as first 
order linear differential operators are all valid descriptions of the same concept! 

A word is in order concerning the basis vectors that we have used above. 
Note that they were induced by the coordinate system that we used. The choice 
of a coordinate system z"^ on the manifold M gives rise to a natural basis d / dz°' 
in each tangent space at each point of the manifold (or, more precisely, at each 
point of M where the chart z" is operative). A change in coordinate system thus 
gives rise to a change of basis; this is in accordance with the usual transformation 
properties of contravariant vectors. A basis that is thus induced by a coordinate 
system is called a coordinate basis. In the "arrow" picture, the basis vectors lie 
along the local coordinate axes. In the "equivalence class of curves" picture, they 
are curves that are locally coincident with the coordinate axes. In the "operator" 
picture, they are directional derivatives along the coordinate directions. 

One might well ask if all possible bases are coordinate bases. The answer is 
"no." If we start from a coordinate basis and make a change of basis by taking 
various linearly independent combinations of basis vectors in each tangent space, 
where the combinations may vary from point to point in the manifold, we may 
arrive at a new basis that is not the coordinate basis for any coordinate system 
on M. Thus, starting from the coordinate basis, d/dz°^, we may define the new 
basis 

e/3 = A,"^, (2.9) 

where (A^°) is any nonsingular matrix. This new basis is perfectly good for 
resolving vectors into coordinates. For example, the vector V may be written 

V = y"-^ = (y"A^Je^ (2.10) 
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where the matrix (A^^) is the inverse of the matrix (A^°). So the components 
of V in the new basis are V"A^^. The only different thing about this new basis 
is that there may not be any system of coordinates such that Gq, = d/dZ"". 
In this case, such a basis is caUed a noncoordinate basis. This idea wiU become 
more clear and examples will be given in Subsection 12.2.51 

Meanwhile, since we have now attached vector spaces to every point of a 
manifold, we can go on to construct their dual spaces. The dual space to the 
tangent space of vectors at point P is called the cotangent space at point P. Its 
elements are called covectors or covariant vectors or one forms. Once again, the 
cotangent space has the same dimension as the manifold. 

Once we have a set of basis vectors in the tangent space, say Gq,, there is 
induced a preferred set of basis covectors in the cotangent space, call them 
cD", such that (tD", e^) = dp. Thus we can represent a covector at point P by n 
numbers, say Oq, where, as usual, a can range from 1 to n. The abstract covector 
is then a = aQ,cD°. The covector a pairs with the vector V to yield 

(a, V) = (a«cD«, V^ep) = a«y^(cD«, e^) = a^V^'S'^ = a^V". (2.11) 

Note that even though there is a naturally induced covector basis corre- 
sponding to a given vector basis, there is no natural correspondence between 
individual vectors and individual covectors. That is, there is no natural map 
from the tangent space to the cotangent space. Later on, we shall see that if we 
endow our manifold with a metric, such a map is established. The addition of 
a metric thus gives the manifold much more structure than it would otherwise 
have. At this point in our discussion, we are not assuming the existence of a 
metric on our manifold. As we shall see, even without a metric, a manifold has 
lots of interesting structure to study. The general philosophy of this discussion is 
to start simply and slowly add structure; thus a discussion of metrics is deferred 
to the end of this section. 
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To make our discussion of covectors more concrete, let us suppose that we 
have a coordinate system z"' on our manifold, M. This induces the coordinate 
basis vectors djdz"^ on each tangent space of M. If we transform coordinates 
to another system Z", the components of the vector V transform according to 
Eq. fj2.7|l . Now say the covector a has components in the first coordinate 
system. The components of the covector must transform in such a way as to 
leave the scalar (a, V) invariant. Thus 

a;3y^ = a'«r"=a'„|^y^ (2.12) 

so 

dz^ 

«'a = (2-13) 

Once again, readers familiar with traditional presentations of tensor calculus will 
recognize this as the transformation law for components of covariant vectors. 

Now, how can we introduce bases in our cotangent spaces that will reflect the 
above ideas? Despite the fact that the n numbers coordinate-dependent, 
if we form the differential that has them as coefficients 

a = a^dz"^ = a'adz'", (2.14) 

we see that this retains its form under a coordinate transformation. This much 
is clear from the above equation. We can thus identify the differential form a 
with the covector a, and the n differentials dz"" with basis covectors that span 
the cotangent space. 

Thus, just as contravariant vectors could be identified with first order linear 
differential operators, we see that covectors can be identified with differential 
forms. These descriptions are dual to each other, so 

{dz-^^) = 5-,. (2.15) 

Finally we note that the same distinction between coordinate and noncoor- 
dinate bases that applied to our discussion of tangent space bases also applies 
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to cotangent space bases. Up until now, we have restricted our attention to 
coordinate cotangent bases, but we could define new basis one forms by taking 
linear combinations of the dz°^ where the combinations may vary from point to 
point in the manifold. In this way, we may arrive at a new basis that is not the 
coordinate cotangent basis for any coordinate system on M. Thus, starting from 
the coordinate cotangent space basis, dz", we may define the new cotangent 
space basis 

cD^ = A^^dz" (2.16) 

where (A''^) is any nonsingular matrix. This new basis is perfectly good for 
resolving covectors into coordinates. For example, the covector a may be written 

a = a^rfz" = (a„A^")(Z'^ (2.17) 

where the matrix (A^") is the inverse of the matrix (A^^). So the components 
of a in the new basis are a^A^". The only different thing about this new basis 
is that there may not be any system of coordinates such that oj°' = dZ"^. 
Once again, this idea will become more clear and examples will be given in 
Subsection ITTHl 

2.2.2 General Tensors and the Tensor Product 

Now that we have a tangent space and a cotangent space associated with each 
and every point of our manifold, we can create still bigger spaces at each point 
by taking the Cartesian product of some number of tangent spaces and some 
number of cotangent spaces. Suppose we define the space to be the Carte- 
sian product of s copies of the tangent space and r copies of the cotangent 
space at point P of a manifold M. Consider a multilinear map i— 3?. That 
is, we are considering a map that takes s vectors and r covectors at point P 
and returns a real number. If the s vectors are denoted Vi, . . . , Vg, and the 
r covectors are denoted a^ , . . . , a*" , then the real number will be denoted by 
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T(a^, . . . , a'', Vi, . . . , Vs). By a "multilinear" map, we mean that T is linear in 
all of its arguments. Such a map is said to be a tensor of type (r, s) . Note that a 
vector is a tensor of type (1, 0), and a covector is a tensor of type (0, 1); this is 
because a vector can take a covector and return a real number (by the pairing), 
and vice versa. 

There is an obvious way to define addition among tensors: Given two tensors, 
Ti and T2, we can define a new tensor, T3, by the prescription 



T3(a^...,a^Vl,...,V,) 

= Ti(a\ . . . , a^ Vi, . . . , V,) + T2(a\ . . . , a^ Vi, . . . , V,), (2.18) 



for all possible arguments. In this case, we write T3 = Ti + T2. This operation 
of addition makes the space of all tensors of type (r, s) a vector space. 

Suppose we have two vectors, Ui and U2, and a covector, b^, at some point 
of a manifold. Suppose we are given anew a pair of covectors, a^ and a^, and a 
vector, Vi (at the same point of the manifold). Consider the following recipe for 
obtaining a real number: Pair the two covectors with Ui and U2, respectively, 
and pair the vector with b^. This gives us three real numbers. Multiply them 
together to get a single real number. In this way, the presence of Ui, U2, and b^ 
provides us with a map from 112 to 3?. It is easily seen that this map is multilinear. 
Thus, the presence of Ui,U2, and b^ provides us with the following tensor of 



A tensor formed in this way is said to be the tensor product of Ui, U2, and b^. 
This is denoted 



type (2,1): 



T(a\a2,Vi) = (a\Ui)(a2,U2)(b\Vi). 



(2.19) 



T = Ui ® U2 ® b^ 



(2.20) 



More generally, given r vectors, Ui, . . . , U^, and s covectors, b^, . . . , b*, we 
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can form a tensor of type (r, s) by taking the tensor product 

T = Ui ® . . . ® ® O . . . ® b^ (2.21) 

If we feed this tensor the r covectors, ai, . . . , a^, and the s vectors, Vi, . . . , V^., 
then we get the scalar 

T(ai,...,a„Vi,...,V,) = (ai, Ui) . . . (a„ U,) (bi, Vi) . . . (b„ V,). (2.22) 

The space of aU possible tensors (of any type) at some point in an n- 
dimensional manifold may be thought of as an infinite dimensional vector space, 
although it is somewhat strange in that two of its elements can be added if and 
only if they are tensors of the same type. In any event, the tensor product makes 
this space an algebra. 

It is straightforward to see that the vector space of all tensors of type (r, s) 
is n'^''~'*-dimensional. That is, a tensor of type (r, s) has n'^"'"^ independent com- 
ponents. A moment's thought convinces one that a basis for this space is given 
by the n^~^^ basis tensors 

®...®ea, ^w^^ ®...®cD^% (2.23) 

where the e's and (D's are the basis vectors and basis covectors in the tangent 
and cotangent spaces, respectively, and where the a and (3 indices all range from 
1 to n. Thus, a general tensor may be written 

T = T^^rJ^e^^ ® . . . ® ® cD^i ® . . . ® (2.24) 

Finally, we consider the transformation properties of the components of these 
general tensors. We know how vector and covector components transform, and 
we know that a tensor of type (r, s) takes r covectors and s vectors and returns a 
scalar invariant. Thus, by an argument identical to that which led to Eq. ()2.13p . 
we find that for a transformation from one coordinate basis to another coordinate 
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basis 

„,ai...a, ^ OZ OZ UZ rp^jL^...^^ (O 'yK\ 

The usual distinction between coordinate and noncoordinate bases applies here 
as well, so that for a transformation between general bases the above equation 
generalizes to 

T'^i-"" = A"i„ ■ ■ ■ A"'',^ A. ■ ■ ■ A. "^Tj^^-Jf". (2.26) 

Pl---Ps Ml Mr Pi Ps h>l...l>s \ ' 

2.2.3 The Lie Bracket 

Given a vector field, V, the corresponding first-order linear differential operator 
is: 

V = ]/"-^. (2.27) 

Notice that the ath component of the vector can be recovered by applying the 
operator to 2;°: 

ya^Y^a_ (2.28) 

As has been mentioned, it is possible to actually identify the vector with its 
corresponding operator. Many mathematics texts actually do this, and it is per- 
fectly permissible since there is an obvious one to one correspondence between 
vectors and first-order linear differential operators by the above equations. In- 
deed, there are numerous advantages to such identification, but we shall continue 
to use the circumfiex to distinguish the operator in order to avoid any ambiguity. 

It is important to note that the operators corresponding to two different vec- 
tor fields do not, in general, commute. Indeed, the commutator of two first-order 
linear differential operators is another first-order linear differential operator. At 
first this may seem surprising because it is not obvious that this commutator is 
a first order operator. By writing it in terms of the components of V and U, 
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however, we see that the second order terms do indeed cancel: 



V,U 




d 



d 



V 



Oi 



d 



) 



(2.29) 



The vector whose operator is the commutator of the operators of two other 
vectors, V and U, is said to be the Lie bracket of those two vectors, and is 
denoted by [V, U]. Note that the Lie bracket operation makes the space of all 
vector fields into a Lie algebra. 

Using the Lie bracket, it is possible to give a simple test that will determine 
whether or not any given set of basis vectors is a coordinate basis: A set of 
n linearly independent vectors constitutes a coordinate basis if and only if the 
Lie bracket of any two elements of the set vanishes. The "only if" part of this 
theorem is obvious, since coordinate basis vectors are partial derivatives and 
these always commute with each other. The converse, however, is a special case 
of something called Frobenius' theorem, and is somewhat harder to see. To 
prove it algebraically, we must show that it is possible to actually construct a 
coordinate system (at least locally) given the n linearly independent commuting 
vectors. We shall not follow this approach here (see Schutz ^3] for details on 
how to prove it this way). Instead, we shall follow a more geometrical line of 
reasoning that will make the theorem almost obvious. To do this, however, we 
first need to learn about the Lie derivative. 

2.2.4 Lie Derivatives 

The Lie derivative of a scalar field, f{z), with respect to the vector field, V, is 
a new scalar field denoted by and is given by: 



Cvf = V/ = y 



a 



df 



(2.30) 



dz"^' 
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This is recognized as the directional derivative of / along the vector field, V. 
Along any given field line of V, it is possible to define a coordinate, A, such that: 

V ^ ^ (2.31) 

so 

and so the Lie derivative of / with respect to V is simply df /dX. That is, we 
evaluate the scalar field at the points z{Xo) and 2;(Ao + SX) along the field line, 
subtract the first value from the second, divide the result by 5 A, and let 5 A go 
to zero to get the Lie derivative. In Fig. 12.11 these two points of evaluation are 
denoted by A and B. 

A scalar field, /, whose Lie derivative with respect to V vanishes is said to 
be a Lie dragged scalar field with respect to the vector field V. Intuitively, this 
means that the scalar field is constant along the field lines of V. Alternatively 
stated, it means that the scalar field satisfies the first-order linear differential 
equation V/ = 0, whose characteristics are the field lines of V. Thus, if the value 
of a Lie dragged scalar field is specified at any one point of a field line of V, its 
value everywhere else on that same field line is determined (it's the same value). 
Using this concept, we can reword our definition of a Lie derivative: Begin by 
evaluating the scalar field / at point A. Next, drag the scalar / at point B back 
to point A to get the scalar /* at A (note f*{A) = f{B)). Now at the point A 
we subtract / from /*, divide the result by 5 A, and let 5 A go to zero to get Cyf- 
This may sound like a fancy way of saying the same thing, but it will aid in our 
efforts to generalize the Lie derivative to act on other things besides scalars. 

Consider the problem of trying to define an analogous derivative that acts on 
contravariant vectors. We could begin by evaluating a vector field, say U, at the 
same two points, ^(Ao) and z{Xo + SX), along a field line of V. Unfortunately, 
however, we cannot subtract them because they live in two different vector 
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Field line of V 




Figure 2.1: Lie Differentiation of a Scalar Field 
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spaces: The first lives in the space of all vectors at the point -z(Ao), while the 
second lives in the space of all vectors at ^(Ao +5A). We are dealing with spaces 
in which there may be no notion of parallel transport, so there is no natural way 
of comparing vectors located at two different points. 

So we must be a little more clever. Refer to Fig. 12.21 Just as we have 
coordinatized a given field line of V by A, we shall use fi to coordinatize a given 
field line of U. The operator U is then d/ dfi. It acts on scalars by evaluating them 
at z{fio) and z{pQ + dfi), subtracting the first value from the second, dividing 
by dfi, and letting go to zero. In Fig. 12.21 these two points of evaluation are 
denoted by A and C; note that we have arranged things in this figure so that 
point A is parametrized by both Aq on the V field line, and fio on the U field 
line. 

Now we can imagine sliding the points A and C along the V field lines for 
an increment 5A, to arrive at the new points B and D, respectively. These new 
points define a new first-order linear differential operator based at the point B. 
It acts on scalars by evaluating them at the points B and D, subtracting the first 
value from the second, dividing by 6fi (it is clear that points B and D coincide 
as 6fi ^ 0), and letting 6fi go to zero. This first-order linear differential operator 
at B corresponds to a vector at point B, and so we see that we have found a 
natural way to drag the vector field U along the vector field V. If a vector field 
U is unchanged by dragging it along V, then it is said to be a Lie dragged vector 
field with respect to V. 

Armed with this insight, we are ready to define the Lie derivative of a vector 
field, U with respect to another vector field, V. We begin by evaluating U at 
point A. Next, we drag the vector U at point B back to point A to get the 
vector U* at A. Now we can subtract U from U*, divide the result by 5A, and 
let dX go to zero to get Cy^- It should be clear from this description that the 
Lie derivative of a Lie dragged vector field vanishes, just as was the case for 
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U (A) dragged to B 




Figure 2.2: Lie Dragging a Vector Field Along Another Vector Field 
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scalar s. 

Now that we have the geometrical picture of what is happening, we need to 
find an analytic expression for £yU. Refer to Fig. 12.31 It is clear that we may 
write: 



and 



{CvVnA) 



Hm 



Hm 



^ct a 



lim 

<5At^0 Oil 



lim 



lim hm — - — ^— ^ 



(2.33) 



where through z^ are the coordinates at the points A through F, respec- 
tively. To find these coordinates, we use Taylor expansion. Thus, to express the 
coordinates of point B in terms of quantities at point A, we write: 



dz' 

A 



z1 + 



dX 



„ 1 d'^z" 
6X + - 



2 dX^ 



6X^ + 



Similarly, the coordinates of point C are given by: 



(2.34) 



zc 



z"{iJo + 6fj,) 
dz 

A 



z1 + 



d^ 



1 



+ 



A 



(2.35) 



Next, the coordinates of point E can be expressed in terms of quantities at point 
B: 



-E — ^B 



Z% + 



dz' 



dH 



Sh + 



1 d^z"^ 



B 



2 dn' 



+ ■ ■ 



(2.36) 



B 



and these in turn may be expressed in terms of quantities at point A: 

dz"^ 



Ze 



z2 + 



dX 



„ 1 d'^z" 
SX + - 



2 rfA^ 



5X^ + 



A 
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Figure 2.3: Lie Differentiation of a Vector Field 
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+ 



^ dXdfx 



5X6ii + 



1 d^z" 
+- 



2 d^' 



(2.37) 



A 



Finally, the coordinates of point F can be expressed in terms of quantities at 
point E, which in turn can be expressed in terms of quantities at point B, which 
in turn can be expressed in terms of quantities at point A: 

E 



dX 

dz" 



„ 1 
+ - 



2 dX^ 



6X^ + 



E 



dix 

dz"^ 



B 



1 d'^z'^ 



+ 



B 



dX 



6X 



d'z 



B 



dfidX 



d/idX — 



B 



1 d'^z'^ 
+ - 



2 dX^ 

dz" 



6X' + - 



B 



+ 
+ 



dX 

dz" 



„ 1 d^z"" 
+ - 



dii 
1 d^z'^ 



2 rfA2 
^ dXdfi 



6X'^ + 



5X5n + ■ 



2 djj,'^ 
dz"^ 



dX 

d'^z" 



A 

6X 



6^' + -- 

d^z" 



dX^ 



6X' 



+ 



dfidX 
1 d^z'^ 



2 dX^ 

dz" 



SfiSX 



5X' + ■ 

A 



dn 



1 



2 dn^ 



5^2 + 



2^a 



+ ■ 



d'z 



dXdn 

Thus, using Eqs. ()2.33p . we find: 

dz"^ 



d'^z" 

^ d\idX 



S^SX + 



A 



U"{A) 



dfx 



(2.38) 
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and 



and so 



U*"{A) 



dfx 

d^z'' 



f d'^z" 



y^'^ \ dXdn 

d^z" 
^ d^dX 



d^z'^ 
^ dfxdX 



5X + 



dXdn 

= ( VU - UV^ z° 



V,U 



A 



(2.39) 



We have just demonstrated that the Lie derivative of U with respect to V is 
simply the Lie bracket of V and U: 



£vU= [V,U] 



In a coordinate basis, this result may be written 



(2.40) 



,/3- 



(2.41) 



Note that this way of writing the result may be taken as valid for a noncoordinate 
basis as well if we reinterpret the commas as meaning "operation by the basis 
vector." That is, denotes the result of applying to / the operator correspond- 
ing to the basis vector Gq,. For a coordinate basis, the operators corresponding 
to basis vectors are simply partial derivatives with respect to the coordinates, 
so this reduces to the usual meaning of the comma. This generalization of what 
the comma means will be useful in everything that follows. 

Now that we know how to take the Lie derivative of a contravariant vector 
field, we shall try to extend this process to covector fields. Recall that covectors 
contract with contravariant vectors to give scalars. We define a Lie dragged cov- 
ector field to be one which when contracted with any Lie dragged contravariant 
vector field yields a Lie dragged scalar field. To take the Lie derivative of a cov- 
ector field a with respect to V, we evaluate a at the points A and B in Fig. 12.11 
drag a(i?) back to A to get a* (A), subtract Sl^A) from a* (A), divide by 5A, and 
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let go to zero. The result is: 

{CvB.)c. = V^a„,p + V^^^ap, (2.42) 

Once again, this result is valid for noncoordinate bases if we generalize the 
meaning of the commas. 

Next we consider the Lie derivative of a general tensor. We first define a Lie 
dragged tensor of type (r, s) as one which yields a Lie dragged scalar field when 
fed r Lie dragged covectors and s Lie dragged vectors. To take the Lie derivative 
of a tensor T of type (r, s) with respect to V, we evaluate T at the points A and 
B in Fig.O drag T{B) back to A to get T*(A), subtract T{A) from T*(A), 
divide by 5 A, and let 5 A go to zero. The result is: 

_T/ai rp'ya2.--OLr _ 

lT/7 rpai...ar. I 

+ ,/3l^7/32.../3. ^ 

+v\T-^::£:^^ (2.43) 

Note that the above geometrical picture for Lie derivatives of general tensors is 
equivalent to the neat coordinate-free algebraic formula 

i:y (T(ai,...,a,,Ui,...,U,)) = (/:yT)(ai, . . . , a„ Ui, . . . , U,) 

+T(/:yai,...,a„Ui,...,U,) + --- 
+T(ai, . . . , CvAr, Ui, . . . , Us) 
+T(ai,...,a„£yUi,...,U,) + --- 
+T(ai,...,a,,Ui,...,£v^U,).(2.44) 

Finally, it is straightforward to show that Lie derivatives obey the Leibniz 
rule over the tensor product. That is 

CviTi ® T2) = (CvTi) ® T2 + Ti ® iCvT2). (2.45) 
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Before leaving this subsection, it is important to emphasize that the same 
geometrical notions that led us to the Lie derivative of a vector field still apply for 
arbitrary tensors: The Lie derivative is the natural way to "drag" any tensorial 
object along the field lines of a vector field. Just as we dragged the vector U 
along the field line of the vector V for a parameter interval 6X to get the vector 



(see Eq. fj2.33p ). so we can drag the tensor T in exactly the same way to get 



This geometrical insight is crucial to the understanding of Lie transforms. 

2.2.5 Examples of Coordinate and Noncoordinate Bases 

We are now in a position to understand the theorem presented at the end of 
Section 12.2.31 from a geometrical point of view. Fig. 12.31 and Eq. ()2.33|] make 
it clear that the Lie bracket of two vector fields is related to the infinitesimal 
difference in position resulting from the operation of moving along the first 
vector field for a certain parameter interval, then along the second vector field, 
then backwards for the same parameter interval along the first, then backwards 
along the second. Clearly, if the vector fields involved are basis elements of a 
coordinate basis, this operation will simply take one around a square right back 
to the original position. The sides of the square are the contours of constant 
values of the two coordinates involved. Conversely, if two members of a set of n 
linearly independent vectors have nonvanishing Lie bracket, then it is impossible 
to construct a coordinate system that has those vectors as a basis because moving 
around the above-described infinitesimal loop does not return one to the starting 
point; the changing parameters do not "hook together" in the manner necessary 
for them to be coordinates. 



U* = U + (£yU)5A 



(2.46) 



T* = T + (£x/T)5A. 



(2.47) 
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Part of the reason that this concept of coordinate and noncoordinate bases 
is tricky is that there is no need for such a distinction in Cartesian coordinates. 
There, the coordinate basis is identical to the usual orthonormal basis 

^ 5a- (2.48) 



A good example of a familiar situation for which the distinction is important is 
that of polar coordinates in two dimensions. The usual polar unit vectors, f and 
9, are not a coordinate basis since 



-. (2.49) 

r 



On the other hand, d/dr and d/d9 do constitute a valid coordinate basis, and 
these are related to the above orthonormal basis by 

d_ _ . 
dr 

and 

§6 = ^^-^^^ 

The important point is that there are no pair of coordinates, ^ and 77, such 
that f = d/d^ and 9 = d/dr]. Geometrically, this is because if we traverse an 
infinitesimal loop following first the f vector field and then the 9 vector field 
(and then returning along them, respectively) we will not arrive at our starting 
point (see Fig. I2.4jl . The factor of r on the right hand side of the second of 
Eqs. fl2.5()|l corrects for this and gives us a coordinate basis. 

As mentioned previously, the above distinction also holds for covectors. To 
pursue the above example, the covector basis consisting of dr and d9 is dual to 
the vector basis consisting of d/dr and d/d9. It follows that the covector basis 



and 



r = dr 
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Figure 2.4: Polar Coordinate Unit Vectors 
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e 



rde 



(2.51) 



is dual to the vector basis f and 9. Once again, there is no pair of coordinates, 
^ and ?7, such that r = and 9 = drj. 



An s-form is defined as a tensor of type (0, s) that is antisymmetric in every pair 
of its s vector arguments. In particular, a zero form is a scalar and a one form 
is a covector; a two form, O, obeys 



etc. It follows that the components of an s-form are antisymmetric under in- 
terchange of any pair of indices. In particular, this means that the s indices 
must all be different, or else the component will vanish. Hence the requirement 
of antisymmetry means that there are no longer independent components. 
Instead, a standard combinatorial argument shows that only 



of the components are truly independent. This means, among other things, that 
there are no nontrivial s-forms in an n-dimensional space if s > n, that an n- 
form has only one nontrivial component, etc. The total number of independent 
components of all forms in a space of dimension n is thus 



Note that s-forms inherit some properties from the fact that they are tensors 
of type (0, s). In particular, two s-forms may be added to get a third s-form. 
Thus, the set of all forms at a point in an n-dimensional manifold may be thought 



2.2.6 Differential Forms 




(2.52) 




(2.53) 




(2.54) 
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of as a 2'^-dimensional vector space, although it is somewhat strange in that its 
elements may be added if and only if they are both s-forms for some s. Note, 
however, that this space is not an algebra under the tensor product operation 
because it is not closed under that operation: The tensor product of two forms 
is not necessarily a form. If we take the tensor product of an si-form with an 
S2-form, we get a tensor of type (0, si + S2) that is clearly antisymmetric under 
interchange of any two of its first si or last S2 arguments, but is not necessarily 
antisymmetric under interchange of one of its first si components with one of 
its last S2 components. 

2.2.7 The Wedge Product, the Interior Product, Dual 
Tensors 

It would thus be nice to define a product under which the set of all forms 
becomes a closed algebra. Such a product is called the wedge product, and is 
denoted by the symbol A. We motivate its definition as follows: The wedge 
product of a scalar (zero form) with any s-form is the s-form obtained by simple 
multiplication by the scalar. The wedge product of two one forms, and a^, is 
the two form given by 



It is clear that the two form thus obtained is antisymmetric. For three or more 
one forms, we demand that the wedge product be associative, so, for example 



= a^ ® a^ ® a^ + a^ ® a^ ® a^ + a^ (g) a^ ® a^ 

-a^ ® a^ ® a^ - a^ (g) a^ ® a^ - a^ ® a^ ® a^. (2.56) 

Recall that the total number of independent components of an s-form in a 



a^ A a^ = a^ (g) a^ - a^ (g) a^ 



(2.55) 



a^ A a^ A a^ 



a^ A (a^ A a^) 
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space of dimension n is given by | | • A moment's thought convinces one 

(n 
I independent 

basis s-forms 

Cj^' A . . . A (2.57) 

where the {3 indices range from 1 to ti, and must all be different (else the above 
expression will vanish). It is then straightforward to see that an arbitrary s-form, 
O, is given by 

n= ^np^,„p^Cjf^^ A...ACjf^\ (2.58) 

Note that the factor of s! appears here because we did not put it into the 
definition of the wedge product; we could have done it either way, and authors 
differ in this convention. 

Now that we know how the wedge product operates on scalars and one forms, 
we can extend its definition to arbitrary forms by writing them in terms of wedge 
products of basis one forms, as shown above. This makes the 2"^-dimensional 
vector space of all forms into an algebra, called a Grassmann algebra. Note that 
it is not a commutative algebra: If and are si and S2-forms, respectively, 
then 

Afi2 = (-1)''''J^2 Afii. (2.59) 

If we contract the first index of an s-form (where s > 1), O, with a vector, 
V, then it is straightforward to see that we get an [s — l)-form. We call this 
new form the interior product of O with V, and we denote it by iy^- Thus 

iv^= , ^ V^^^p,...p^u^^ A...Au^^. (2.60) 

If 0} and are si and S2-forms, respectively, then it is straightforward to show 

zy(0^ A O^) = {iyVL^) A + (-l)^ioi A (zy02)_ (2.6I) 
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f n \ 


: 




\ n — s j 



Also, the antisymmetry of forms makes it clear that 

iyiyn = (2.62) 

for any n-form O with n > 2. 

Recall that we defined an s-form as a completely antisymmetric tensor of type 
(0, s). Note that we could have done the same thing for completely antisymmetric 
tensors of type (s, 0). Next note that a completely antisymmetric tensor of type 
(0, s) has exactly the same number of components as a completely antisymmetric 
tensor of type (n — s, 0) in a space of dimension n > s. This is because 

(2.63) 

This suggests that there may be a one-to-one correspondence between s-forms 
and completely antisymmetric tensors of type (n — s, 0). 

For example, note that there is only one independent component of a com- 
pletely antisymmetric tensor of type (n, 0) . This is because the components of 
such a tensor must be proportional to those of the Levi-Civita symbol, e/^i - /^" . 
The proportionality constant is a scalar (zero form). Similarly, we can put any 
scalar (zero form) in front of the Levi-Civita symbol, and obtain the components 
of a completely antisymmetric tensor of type (n, 0). Thus, there is a one-to-one 
correspondence between scalars (zero forms) and completely antisymmetric ten- 
sors of type (n, 0). 

More generally, we can use the Levi-Civita symbol to obtain a one-to-one 
correspondence between s-forms and completely antisymmetric tensors of type 
(n — s, 0) as follows: 

7^^"-^"-^ = ^e^^-^"0^„_.+,.../3„, (2.64) 

and 
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Here we have used the easily verified relation 

ef'^-f'-ep,...p^=nl (2.66) 

Referring to Eq. (|2.64p , we say that T is dual to O with respect to e. This is often 
abbreviated T = *f2. Referring to Eq. ()2.65|1 . we say that O is dual to T with 
respect to e, or O = *T. Note that for any form, O, we have **0 = (— l)'*('^~*)r2. 

2.2.8 The Exterior Derivative and the Homotopy Formula 

We now define a differential operator, d, that converts s-forms into (s + l)-forms. 
This operator is defined as follows: When applied to a scalar (zero form), /, it 
yields the one form, df, such that 

df{V) = Vf. (2.67) 

Thus, in a coordinate basis, z", we have 

df = §^dz-. (2.68) 

Next, we demand that the operator be linear, so if O and A are two s-forms 
then 

d{n + A) = dn + dA. (2.69) 
Next, we demand that if f2i is an si-form and f22 is an S2-form, 

d(fii A O2) = dfli A fi2 + (-l)^'Oi A rf02- (2.70) 

Finally, we demand that for any s-form, f2, we have 

ddn = 0. (2.71) 

The above demands define the operator d uniquely and unambiguously. We can 
apply the exterior derivative to an arbitrary form by first expanding it in terms 
of wedge products of basis one forms, and then applying the above rules. 
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In terms of components in a coordinate basis, the exterior derivative of a 
scalar is 

(rf/)a = /,a, (2.72) 

and the exterior derivative of a one form is 

{da)c,f3 = ap^a - aa,f3- (2.73) 

More generaUy, 

(-1)^ 

(<iO)Q,^. = -^e/3,.../3^+iOa^^...a^^,a^ ■ (2.74) 

A form whose exterior derivative vanishes is said to be closed. A form that 
is the exterior derivative of another form is said to be exact. Clearly, any exact 
form is closed. The interesting question is whether or not any closed form is 
exact. The answer to this depends on the global topology of the manifold on 
which the closed form lives. Locally, it is always true. 

There is a marvelous relationship between Lie derivatives, interior products, 
and exterior derivatives. It is possible to prove that 

Cv^ = ivd^ + div^ (2.75) 

for any n-form, with n > 1, and any vector field, V. This relationship is called 
the homotopy formula. The proof usually given (see for example Section 4.20 of 
Schutz [^) proceeds by induction: It is first proved for a one- form, and then it 
is shown that it works for an 7i-form if it works for an (n — l)-form. 

The generalized homotopy formula, 

c^n = {ivdyn + {divyn (2.76) 

for J > 1, is proved by induction as follows: First note that it reduces to the 
ordinary homotopy formula when j = 1. Next, assume that it is true for j = I. 
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Then 

C^+^n = {ivd + div)[{ivdyn + {divyn] 

= {ivdY+^n + idivY+^n, (2.77) 

where we have used the fact that apphcation of dd or iyiv causes any form to 
vanish. Note that the generahzed homotopy formula is not true for j = 0. 

FinaUy, we can show that Lie derivatives commute with exterior derivatives. 
This is done as follows: 

dCv = diiyd + div) = diyd = {iyd + div)d = Cyd, (2.78) 

where we have used the homotopy formula. 

2.2.9 Integration on Manifolds 

Differential s-forms can also be introduced as integrands of s-dimensional inte- 
grals. See Flanders JHl for more on this approach. Adopting this point of view, 
it is possible to prove the generalized Stokes' theorem 

I dn= I O, (2.79) 

Ju JdU 

where U is an (s + 1) -dimensional volume, and dU is the s-dimensional surface 
that bounds it. 

We shall not attempt to prove the generalized Stokes' theorem here (see 
Schutz ^3] for a good presentation), but we shall make it plausible by show- 
ing how it reduces to the familiar Stokes' theorem and divergence theorem of 
three dimensional vector calculus. In three dimensional Euclidean space, with 
Cartesian coordinates, the gradient is given in our notation by 

iVfh = {df),, (2.80) 

the divergence is given by 

V-V = *d*V, (2.81) 
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and the curl is given by 

V X V = *rfv, (2.82) 

where v is the one form whose Cartesian components are identical to those of 
the vector V. Note that 

V X V/ = *ddf = (2.83) 

and 

V ■ (V X V) = *d**dv = *ddv = (2.84) 
both follow from dd = 0. Then 



and 



V ■ Vrfx'^ = / d*V = *\= W -da, (2.85) 

U Ju JdU JdU 



\/xV-da= dv= v = / Y-de. (2.86) 

U Ju JdU JdU 



Thus we see that our formalism is the natural generalization of three dimensional 
vector calculus to manifolds of arbitrary dimension. 

2.2.10 Metric Spaces 

The usual dot product of linear algebra is a rule for taking two vectors, say U and 
V, and associating with them a real number, denoted U ■ V. The result depends 
bilinearly on the two vectors involved, so we see that there is a tensor of type 
(0, 2) at work here. Furthermore, the dot product is required to be commutative, 
so the tensor must be symmetric. Denoting this tensor by g, we have 

U-V = g(U,V). (2.87) 



This tensor is called the metric tensor. If we also demand that it have an inverse, 
then we can find a basis for which it has diagonal form with entries equal to ±1 
(if all the diagonal entries can be made equal to +1, then we say that the metric 
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is definite, otherwise we say that it is indefinite). The trace of the metric in this 
canonical diagonal form is called its signature. 

A metric tensor field is the association of such a type (0, 2) symmetric tensor 
with every point of a manifold. It must have an inverse at every point. It follows 
that the signature is the same at every point of the manifold. 

A manifold endowed with a metric has all sorts of new structure. For the 
purposes of our discussion, its most important role is to provide a one-to-one 
correspondence between vectors and covectors. For, given any vector, say V, we 
can form the covector, g(V, ). The components of this new covector are then 
gcepV^ . Denote the inverse of Qap by g^'^, so 

9c.p9^^ = Sl. (2.88) 

Then, given any covector, say a, we can form the vector with components g^'^afj. 
Note that this is a one to one correspondence. 

Frequently we shall use the same symbol to denote a vector and its corre- 
sponding covector in a metric space. That is, we may write 

K = gapV, (2.89) 

or 

_ g^PVp. (2.90) 

This process is called index raising or index lowering, as the case may be. It can 
be used to raise or lower the indices of any tensor of any type. 

We shall frequently abuse notation by using the dot product to denote the 
interior product of a vector with a covector. That is, we may write 

a- V = a(V) =aa"l/". (2.91) 

When this is done, it will be obvious from context, so no confusion should arise. 
We shall occasionally further abuse notation by using a "double dot" notation 
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for two contracted indices. That is, given two tensors of type (0,2) and (2,0), 
respectively, we may write 



Once again, things should be clear from context. 

A metric tensor field does far more than provide an invertible map from 
vectors to covectors. It also induces on the manifold something called an affine 
connection. This makes it possible to compare vectors in nearby tangent spaces. 
Recall that Lie dragging gave us a way to do this, but there had to be a vector 
field present in the first place along which to drag, and we could drag only in 
the direction of that field. An affine connection allows us to parallel transport 
vectors from one tangent space to any other one nearby; that is, it gives us a 
notion of parallelism between vectors in different tangent spaces. Furthermore, 
it does not require the presence of any vector field there to begin with. One 
does not need a metric to have an affine connection, but the presence of a metric 
induces an affine connection in a natural way. 

Armed with an affine connection, it is possible to go on to define such things 
as curvature and torsion. While knowledge of this material is certainly helpful in 
understanding the material presented in this thesis (especially the curvature and 
polarization guiding-center drifts and the intimate relationship between torsion 
and spin angular momentum), it is not essential. Thus we shall not go on to 
discuss these topics; the interested reader is referred to Schutz [1^] for a good 
introduction, and to Misner, Thorne and Wheeler [201 or Chandrasekhar for 
a more detailed presentation. 



F : G = F^/^C"^. 



(2.92) 
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2.3 Noncanonical Hamiltonian and 
Lagrangian Mechanics 

2.3.1 Canonical Versus Noncanonical Coordinates 

In elementary classical mechanics courses, Hamiltonian mechanics is derived by 
application of a Legendre transformation to the system Lagrangian. This pro- 
cess gives rise to canonical coordinates in a very natural way. When it becomes 
necessary to change coordinates on phase space, the student is taught to restrict 
attention to the limited class of transformations that will maintain this separa- 
tion of the coordinates into canonically conjugate pairs; these are the so-called 
canonical transformations. 

The Poisson bracket of two scalar phase functions, A and B, is then intro- 
duced by defining it in terms of partial derivatives with respect to the canonically 
conjugate pairs of coordinates, q'^ and pi (the index i ranges over all the degrees 
of freedom): 

SABl-——-— — 
dq^ dpi dpi dq^ ' 

where we have adopted the convention of summation over repeated indices. It 
is then shown that this bracket is bilinear: 

{xA + yB,C} = x{A,C} -^y{B,C} (2.94) 

where x and y are constants, that it is antisymmetric: 

{A,B} = -{B,A}, (2.95) 

that it obeys the Jacobi identity: 

{A, {S, C}} + {C, {A B}} + {S, {C, A}} = 0, (2.96) 

and that it obeys the chain rule: 

{f{A),C} = nA){A,C} (2.97) 
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or, equivalently, the Leibniz product rule: 



{AB, C} = A{B, C} + B{A, C}. 



(2.98) 



Mathematicians have a different way of looking at all of this. In mathemat- 
ics courses on Hamiltonian mechanics, one is more likely to define the Poisson 
bracket as any rule for taking a pair of scalar phase functions and associating 
with them a third scalar phase function consistent with the properties listed in 
Eqs. fj2.94|) through fj2.98|l above. Now it is manifest that any Poisson bracket 
given by the physicists' definition is also a Poisson bracket according to the 
mathematicians' definition. The converse, however, is not true; that is, there 
exist Poisson brackets that obey all of the above-listed properties, but are not 
given by Eq. fj2.93|l for any set of canonical coordinates, q and p. Thus, by adopt- 
ing the mathematicians' definition, we can generalize what is meant by a Poisson 
bracket in a very powerful way. 

To see how this comes about, let us take the mathematicians' viewpoint and 
suppose that we have a phase space with coordinates, z", where a ranges from 
1 to A^. For canonical coordinates, N is twice the number of degrees of freedom 
and the are the q's and p's, but let us not restrict ourselves to this special 
case in any way; in particular, N could be an odd number, and there need not 
be any natural pairing amongst the coordinates. 

Denote the Poisson bracket of coordinate with coordinate z^ by: 



Suppose that we changed our phase space coordinates, z i— z'. Then, using 
the chain rule, Eq. ()2.97jl . we see that the Poisson bracket of two of the new 
coordinates is given by: 



J' 




(2.99) 



J 



r /Pi 
{Z ,Z } 



dz' 



dz'^ 




dz^ 
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or 



This makes it clear that the J"^ are the components of a second rank contravari- 
ant tensor. This tensor wiU henceforth be caUed the Poisson tensor. Using the 
chain rule once again, we see that the Poisson bracket of any two phase functions, 
A and B, may be written in terms of the Poisson tensor as follows: 

The general form of the bracket given by Eq. fj2.1U2|l is clearly bilinear and 
obeys the chain rule (or, equivalently, the Leibniz product rule). Now, the other 
two defining properties of the Poisson bracket may be expressed as properties 
of the Poisson tensor. It is easily seen that antisymmetry of the bracket implies 
and is implied by antisymmetry of the Poisson tensor: 

= _j/5a_ (2.103) 

Somewhat more algebra shows that the Jacobi property of the bracket implies 
and is implied by the following property of the Poisson tensor: 

ja^ + J^O"^^^ + J^^ J^"^ = 0, (2.104) 

where the commas denote partial differentiation. Thus, our philosophy shall 
be that any tensor that has these two properties defines a perfectly legitimate 
Poisson bracket according to Eq. ()2.102|) . 

Let us see how this works for canonical coordinates, and pi, where i ranges 
from 1 to the number of degrees of freedom, /. Write = for a = 1, . . . , /, 
and z" = Pa- 1 for a = I + 1, . . . , N where = 21. Now canonical coordinates 
have the bracket relations, {q\q-'} = {Pi,Pj} = and {q^,Pj} = — {Pj,?*} = 5*, 
so the matrix of components of the Poisson tensor is: 

{z,z}= I ° ^ I , (2.105) 
1 
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where and 1 are the I x I nuU and unit matrices, respectively. 

Using this Poisson tensor in Eq. fj2.1U2|) . we easily recover the usual expres- 
sion for the canonical bracket, Eq. ()2.93jl . Furthermore, this Poisson tensor 
is obviously antisymmetric, and it obeys Eq. ()2.1U4p since its components are 
constants so their derivatives are all zero. 

If we start with canonical coordinates, then a canonical transformation is 
any transformation that leaves the Poisson tensor unchanged. If we denote the 
Jacobian matrix of the transformation by: 

M = dz'/dz, (2.106) 

then this condition may be written as the matrix equation: 

J = MJM^, (2.107) 

where the superscript "T" denotes "transpose," and J is the canonical Poisson 
tensor given by Eq. ()2.105p . In what follows, we shall generalize the term canon- 
ical transformation to mean any bracket-preserving transformation, regardless 
of whether or not we started from canonical coordinates. 

Thus far, we have said nothing about the equations of motion. For canonical 
coordinates these are well known to be: 

dH 

OPi 

and 

p. = (2.108) 

where H is the Hamiltonian. These may be written in terms of the Poisson 
bracket as follows: 



and 



= {q\H}, 
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= {p,,H}. (2.109) 
If we use z to refer to the q's and p's, this becomes even simpler to write: 

i° = {;z°,if}. (2.110) 
Alternatively, this may be written in terms of the Poisson tensor: 

= J-^^. (2.111) 

Since this last equation is in tensor form, and since it is known to hold for canon- 
ical coordinates, it must be the correct generalization of the equation of motion 
for noncanonical coordinates. Thus, the complete specification of a Hamiltonian 
system in this new generalized sense requires the specification of both a Poisson 
tensor and a scalar Hamiltonian. 

Any dynamical system on phase space can be expressed in the form i° = 1/°, 
where V is some vector field on the phase space. Eq. fj2.111|) for a Hamiltonian 
dynamical system has this form. Note, however, that in order to qualify as 
"Hamiltonian," the vector field on the right cannot be just any vector field; 
it must be given by the Poisson tensor contracted with the gradient of some 
scalar function. A vector field on phase space is called a Hamiltonian vector 
field if there exists some scalar field for which this is true. Thus, if a manifold is 
endowed with a Poisson tensor, then scalar fields generate Hamiltonian vector 
fields. 



2.3.2 An Example of a Noncanonical Poisson Structure 

There are several ways that noncanonical Poisson structures can arise in a prob- 
lem. The first and most obvious way is to start with canonical coordinates and 
make a noncanonical transformation. The canonical Poisson tensor is known to 
obey Eqs. fj2.103|l and ()2.104|) . and since these are tensorial equations they will 
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hold in all frames if they hold in any one frame. So the result of a noncanonical 
transformation will be a new bracket that obeys all the required properties. 

A particularly beautiful example of this has been given by Littlejohn [221 for 
the problem of a charged particle in a magnetic field. For canonical coordinates, 
q and p, the Hamiltonian is well known to be: 

^^=7^fp--A(q))', (2.112) 



2m \ c 

where A(q) is the vector potential. Make the noncanonical transformation to 
new coordinates, r and v, where: 

r = q 

and 

V = -(p--A(q)). (2.113) 

m \ c / 

The bracket relations among the new coordinates are easily calculated: 

{r,r} = 0, 

{r,v} = -1, 

m 



and 



{v,v} = i-O, (2.114) 



where we have defined the matrix f2 with components: 

^ij = ~ ("^i,* ~ ^hj) = 'T^n^''^^^'^ ' (2.115) 

and where the are the components of the ordinary magnetic field pseudovec- 

tor. Thus the bracket of any two scalar phase functions, R and S", is given 
by: 

. i (I? . 1^ - §5 . f ) ^ . (§5 X . (2.116) 

m \ or ov ov or I m'^c \ ov ov / 
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This bracket is easily seen to be antisymmetric. That it satisfies the Jacobi 
identity is less obvious; we know that it must from the arguments given above, 
but a direct proof involves some tedious algebra. The new Hamiltonian is simply: 



and it is readily verified that this Hamiltonian, together with the bracket given in 
Eq. (|2.116|1 yield the correct equations of motion. Note that the vector potential 
is absent from the new formulation; this is construed as an advantage, since the 
vector potential is a gauge-dependent quantity. The above Hamiltonian system 
was the starting point for Littlejohn's work on guiding-center theory |22j . 

Now that we have seen how noncanonical Poisson structures can arise from 
noncanonical transformations of a canonical system, it is natural to ask the op- 
posite question: Given a noncanonical Hamiltonian system, is it always possible 
to find a transformation to canonical coordinates? For noncanonical Hamilto- 
nian systems with a nonsingular Poisson tensor (that is, systems for which the 
matrix of components of the Poisson tensor is nonsingular), there is an impor- 
tant theorem, called Darboux's theorem, that tells us that the answer is "yes." 
A proof of Darboux's theorem is given by Littlejohn and is constructive; 
that is, it gives a prescription for actually finding the transformation to canoni- 
cal coordinates. For Hamiltonian systems with singular Poisson structures, the 
situation is more complicated, and will be discussed shortly. 



Reduction and Noether's Theorem 

Noncanonical transformations from canonical coordinates is only one of many 
ways that interesting Poisson structures can arise naturally. The process of "re- 
duction" of a Hamiltonian system with symmetry is another. Work in this area 
has been pioneered by Marsden and Weinstein (see, for example, reference |23j). 





2.3.3 Reduction 
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A detailed discussion of reduction would be out of place in this work, but 
the general idea is this: Suppose that we have a canonical Hamiltonian system 
with a configuration space symmetry (e.g. spatial translation, rotation, etc.). 
Make the configuration space symmetry group parameter one of the generalized 
coordinates. Noether's theorem then tells us that the corresponding momentum 
is conserved. It is then possible to eliminate this degree of freedom from the sys- 
tem, thus reducing the dimensionality of the phase space by two. This much is 
familiar from elementary courses in classical mechanics. Reduction is an impor- 
tant generalization of Noether's theorem that allows us to similarly "mod out" 
by a symmetry group that acts on all of phase space rather than just configura- 
tion space. After reduction is performed, the resulting Hamiltonian system may 
very well be noncanonical. 

The set of all phase functions together with the Poisson bracket operation 
constitutes a Lie algebra. From a computational point of view, in order to 
perform reduction we must find a representation for which this Lie algebra has 
a closed Lie subalgebra. Furthermore, the Hamiltonian must depend only on 
the elements of this subalgebra. The elements of the subalgebra then constitute 
coordinates for a reduced description of the problem. This is best illustrated by 
example. 

The Free Rigid Body 

One of the most elementary (but nontrivial) examples of this process is the 
Hamiltonian system for a free rigid body. The usual generalized coordinates for 
this problem are the Eulerian angles, 9, (p, and i/^, with respect to some fixed 
space frame. By introducing their canonically conjugate momenta, pe, P(j,, and 
p^, it is possible to write the equations of motion in a canonical Hamiltonian 
format with a six-dimensional phase space. If we choose a body frame for which 
the inertia tensor is diagonalized, then the Hamiltonian for the free rigid body 
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problem is 

1 

H = — + — [{p^cscO -p^cot6)cos-ii; -pesiii-ip]^ 

Zls ZI2 

1 2 

+ — [{P(l, CSC 9 - p^ cot 9) sin ifj + pq cos ifj] , (2.118) 

where /i, I2, and Is are the three diagonal elements of the inertia tensor. 

Consider the three components of the angular momentum resolved in the 
body frame. These can be expressed in terms of our canonical phase space 
coordinates as follows: 

"^1 = {P(f, CSC 9 — p.ip cot 9) sin i/j + pe cos ifj, 

= {pci, CSC 9 — p^ cot 9) cos i/j — pe sin i/j, 

and 

ms = p^. (2.119) 

(See Goldstein for details. Only the result is needed here.) 

By direct calculation with the canonical bracket, we can verify the following 
relations 

{1711,1712} = -1713 

{7712, ms} = -mi 

{m3,mi} = -1712. (2.120) 

Thus, the three components of the angular momentum in the body frame con- 
stitute a closed Lie subalgebra under the operation of the canonical Poisson 
bracket. This means that the subset of functions on the canonical phase space 
that are functions of the m's alone (that is, those functions that depend on 9, (p, 
V'l P9i Pct>i ^^'^ Ptl> through their dependence on the m's) constitutes a Lie 
subalgebra of the Lie algebra of all canonical phase functions. 

We thus adopt the m's as generalized coordinates on a reduced phase space 
of three dimensions. The Poisson tensor on this reduced phase space is then 
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given by J"^^ = —e°'^^mj, or: 

^ —nis m2 ^ 

7713 —mi , (2.121) 

so that the Poisson bracket of any two functions of m, say A and B, is given by: 

M.B} = -m.(|^x|£). (2.122) 

\ dm dm ) 

This bracket must satisfy all the required properties of a Poisson bracket, since 
it was derived by specializing the domain of a canonical bracket; nevertheless, it 
is straightforward and instructive to verify this by direct calculation. 

It is possible to perform reduction only if the Hamiltonian is expressible in 
terms of the reduced coordinate set. For the free rigid body, we have 

i?(m) = ^ + !^ + !^. (2.123) 
^ ^ 2/i 2/2 2/3 ^ ^ 

As usual, the equations of motion are given by lii = {m, if}, or: 

mi = — m2m3 



1 


1 


h 




1 


1 


7r 




1 


1 


T2 





m2 = Y j ^smi 

rhs = (^Y^ 7~) mim2. (2.124) 

As expected, these are indeed Euler's equations for the free rigid body. If the 
rigid body were not free (say, if it were in a gravitational field), then a potential 
energy term would have been present in the Hamiltonian, and that term would 
not have been expressible in terms of the m's. Thus, the reduction process would 
have failed. This is because the gravitational field breaks the SO (3) symmetry 
that makes the reduction possible. 

As we shall see later on in this thesis, the passage from particle coordinates 
to guiding-center coordinates is another example of reduction. The symmetry 
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involved is the group of rotations by the gyroangle, S0(2), and the reduction 
ehminates the corresponding degree of freedom from the system. If this gyrosym- 
metry is somehow broken (say, by a variation in the background field configura- 
tion whose length scale is on the order of a gyroradius) , then the guiding-center 
description is invalidated. 

Euler's Fluid Equations 

Our next example is a Hamiltonian field theory for Euler's equations for the 
fiow of an inviscid, incompressible fiuid. Let us adopt a Lagrangian description 
for such a fiuid wherein each fiuid particle is labelled by a reference position, 
xq. Then the configuration of the fiuid at time t may be specified by giving the 
particle's current position, x as a function of xq and t. Thus, our dynamical field 
variable is x(xo, t). The system Lagrangian consists solely of the kinetic energy 

L = J d^xo^x^^o.t), (2.125) 

where p is the constant uniform mass density. The canonical momentum field is 
then given by 

SL 

p(xo, t) = — = pi(xo, t), (2.126) 

dx(xo,t) 

where the 5's denote functional differentiation. Performing the Legendre trans- 
formation, we see that the system Hamiltonian is 

H = j rf=^xo^p2(xo,t). (2.127) 

The canonical bracket of two functionals of x and p, say A and S, is then 

{A,B} = / d'xo ■ ^- - ■ . (2.128) 

J \dx{xo,t) dp(xo,t) dp(xo,t) dx(xo,t)y 

Now suppose that the fiuid particles are identical. In that case, specification 
of x(xo,t) is far more information than is really necessary to determine the 
configuration of the fiuid. This is because x(xo,t) effectively keeps track of 
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particle labels; two configurations that differ only by swapping identical particles 
will actually have different x(xo, t). For a fluid of identical particles, an Eulerian 
description, wherein the flow velocity is given as a function of spatial position and 
time, say v(^,t), suffices to determine the fluid configuration. The Lagrangian 
description just keeps track of too much information. Thus, in passing from 
the Lagrangian to the Eulerian description, we are effectively reducing by the 
group of identical particle interchanges. The Eulerian description is therefore 
the reduced description. The reduced phase space is the (smaller, though still 
infinite dimensional) space of all divergenceless vector fields, v, that satisfy the 
boundary conditions (v tangential to the boundary). The requirement that 
divv = stems from the fact that we are considering only incompressible flows. 

So, from a computational point of view, how do we perform this reduction? 
Note that the Eulerian velocity field may be written in terms of the Lagrangian 
fields as follows: 

v(^,t) = -p(x-i(^,t),t). (2.129) 
P 

This may be interpreted as follows: If we want the Eulerian velocity at spatial 
point ^, first take x~^(^,t) to get the reference position of the fluid element 
currently at ^, then evaluate the momentum p of the fluid element with this 
reference position, then divide the result by p to get the desired answer. Now 
the above equation may be written 

v(e, t) = - [ rf3xop(xo, t)5(x(xo, t) - 0, (2.130) 
P J 

where we have used the fact that the Jacobian, |9x/9xo|, is equal to unity 
because the flow is incompressible. Thus we have succeeded in expressing the 
reduced field variable, v, in terms of the canonical field variables, x and p. In 
this respect, Eq. ()2.13()jl is the exact analog of Eqs. ()2.119|1 for the free rigid 
body problem. 

Thus, we can take the Poisson bracket of the Eulerian field with itself using 
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the canonical bracket. This is straightforward, and the result is 

{v(e, t), v(r, t)} = - (v(r, t)5'{e - - - r)v(e, t)) , (2.131) 
P 

where S' denotes the gradient of the delta function. Note that we have been able 
to express this bracket in terms of the Eulerian (reduced) field variables alone. 
This equation is thus the analog of Eqs. (|2.12U|1 for the free rigid body problem. 

So we see that the functionals of the Eulerian field variables constitute a 
closed Lie subalgebra of the Lie algebra of all phase functionals. We thus adopt 
the Eulerian field variables as coordinates on a reduced phase space. The Poisson 
bracket of any two functionals of v, say A and B, is then calculated by the Leibniz 
rule 

6A 5B 



{A,B} = ld'^ld'e^^yMU)Me,t)} 



5v(e,t) 

5A 6B 



^2.132) 



.5v(^,t)'5v(^,t)_ 

where the square brackets are Lie brackets, and where the functional derivatives 
dA/dv{^,t) and SB /dv{^,t) are regarded as vector fields. 

Note that Eq. (|2.132j) is the analog of Eq. (12.1 22j) for the free rigid body 
problem. 

We must also check that the Hamiltonian may be expressed in terms of the 
reduced variables. Fortunately, this is not difficult. A change of variables in 
Eq. pT27j) gives 

H = ^ [ d'^v^iU). (2.133) 



2 „ 

where we have again made use of the fact that the Jacobian, |5x/5xo|, is equal 
to unity. 

It remains to check that the Hamiltonian in Eq. ()2.133|) together with the 
bracket in Eq. (|2.132jl actually yield Euler's fluid equations. This is slightly 
tricky. Consider a functional A(y). Its equation of motion is 

^^{A,H}. (2.134) 
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We insert Eq. fj2.133|l for the Hamiltonian. After some straightforward manipu- 
lation, including an integration by parts where the surface term vanishes due to 
the boundary condition, we get 



- ' ' ;2.135) 



At this point, we might be tempted to set the expression in square brackets 
above equal to zero on the grounds that A is an arbitrary functional. This 
would, however, be incorrect because bAjb'w is not really arbitrary. Recall that 
our phase space consists only of those vector fields that have zero divergence. 
This causes an ambiguity in the usual definition of the functional derivative 
which is such that the equation 

A{y + 5v) = A{y) + y Si^N ■ ^ + 0{6v^) (2.136) 

is satisfied. If v and v + 5v are both divergenceless, it follows that 5v is di- 
vergenceless. This means that the gradient of an arbitrary function, may be 
added to 5A/5v, since 

y (i^^^v . v</> = - y d^i<^\I ■ (5v) = 0. (2.137) 

We can make the definition of the functional derivative unique by demanding 
that V ■ (5A/5v) = 0. This gives a well-posed problem for the determination of 
0. 

Now, in order to incorporate this constraint that V ■ (5A/5v) = 0, note that 
if we were to add the gradient of any scalar function, i/j, to the expression in 
square brackets in Eq. fj2.135p . the equation would still hold because 



rf'e^- V^ = -y ci'e^V- ) =0. (2.138) 



So the most that we can write is 



^+v-Vv + vfy + V')=0. (2.139) 
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We now identify the pressure 



(2.140) 



so we finally arrive at Euler's fiuid equation 




(2.141) 



Finally, we note that the pressure is not really arbitrary, but is rather determined 
by taking the divergence of both sides of Eq. ()2.141|1 to get 



and by dotting both sides of Eq. fj2.141|) with the unit normal to the boundary 
surface, n, to get 



This constitutes a well-posed Neumann problem for p as a functional of v. Thus, 
Eq. ()2.141|) , coupled with the constraint of incompressibility, determines both v 
and p. 

It is intriguing that the equations of motion for both examples considered 
thus far are named after Euler; one wonders if he knew about the beautiful 
analogy between them. In fact, the first published reference to this analogy 
seems to be a 1966 paper of Arnold [22] • 

The Poisson-Vlasov System 

Our final example of reduction is also a Hamiltonian field theory, this time for the 
Poisson-Vlasov equations of plasma physics. For simplicity, we consider a one- 
dimensional plasma (the methods are trivially generalized to three dimensions). 
Once again, we label particles by their initial conditions. This time, however, 
the flow is in phase space, so the initial conditions are tq and po, and the present 
phase space position is r and p. The dynamical fields are thus r{rQ,pQ,t) and 



V2p = -V- (v- Vv), 



(2.142) 



dp 
dn 



fi ■ (v ■ Vv). 



(2.143) 
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p{rQ,po,t). We shall use z to refer to the set of coordinates, r and p, and zq to 
refer to the set of initial conditions, tq and po. The fields may thus be abbreviated 
z{zo,t). 

The Lagrangian for this system that includes the electrostatic potential en- 
ergy of interaction was first written down by Low fl^ . It is 

2 ■ 2 



dzofizo) 



-r\zo)-'- I dz'J{z'Mr{zo).r{z'o)) 



(2.144) 



Here we have ignored species labels for simplicity. Also, f{zo) is the distribution 
of initial conditions on phase space, and g{r, r') is the Coulomb potential kernel. 
The canonical momentum field is then 

SL 

^(^o) = jrj-^ = f{zo)mr{zo). (2.145) 

The Hamiltonian is obtained by Legendre transformation 

H^J ^^0^0^ + ^JdzoJ dz'of{zo)f{z'o)g{r{zo),r{z'o)). (2.146) 

The bracket is canonical, with r and tt canonically conjugate. 

Now suppose that the particles are identical. Just as with Euler's fluid equa- 
tions, it turns out that we can reduce to an Eulerian description. This time, the 
Eulerian field variable is the usual distribution function on phase space, f{Z). 
This may be expressed in terms of the Lagrangian field variables as follows: 

f(i?,P,t) = I dzof{zo)d{R-r{zo.t))d(^P-^^^^y (2.147) 



This is the analog of Eqs. (|2. 11911 and ^TH 

Now we can take the canonical bracket of f{Z) with f{Z'). We get 

{f{Z)J{Z')} = J dZ"f{Z"){S{Z - Z"),S{Z' - Z")}o, (2.148) 

where {a, 6}q denotes the single-particle Poisson bracket of a{R", P") with 
b{R" , P"). Note that we have been able to express the bracket of the Eulerian 
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field variables in terms of the canonical field variables; thus we have achieved the 
desired reduction. The bracket of any two functionals of f is found by application 
of the Leibniz rule. The result is 

RB} = /</Zf(Z){^,^}^. (2.149) 

This form for the bracket was first given by Iwinski and Turski , by Morrison 
who credits it to Kaufman j2Hl, and by Gibbons fSHj- A derivation similiar to 
that above can be found in a paper by Kaufman and Dewar |3Uj . 

Finally, we see that the Hamiltonian can be expressed in terms of f as follows 

H = J dZf{Z)^ + ^ j dZ j dZ't{Z)f{Z')g{R,R'). (2.150) 

It is now readily verified that the above brackets and Hamiltonian yield the 
Poisson-Vlasov equations of motion, 

df P df d(p df ^ r^.r.^ 
ot m oR oR oP 



where 



(j)[R) = e j dZ'f{Z')g{R,R') (2.152) 



is the electrostatic potential. 

Note the similarity in structure of the brackets for all three of the above 
examples. For example, all three have a Poisson tensor that is linear in the 
coordinates used. All are examples of what are called Lie-Poisson brackets, and 
there is a rich mathematical literature on brackets of this sort (see, for example, 
Marsden 

2.3.4 Singular Poisson Structures 

There are a few very important observations to be made about the above ex- 
amples before we go on to talk about perturbation theory. First consider the 
free rigid body problem. Note that the matrix in Eq. fj2.121|) is singular with 
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rank two for m 7^ 0, and rank zero for m = 0. Indeed, any odd dimensional 
phase space must have a singular Poisson structure, because antisymmetric ma- 
trices always have even rank. For these systems, Darboux's theorem does not 
apply and it is not possible to find a transformation to canonical coordinates; 
of course, this should have been obvious because canonical coordinates always 
come in pairs and you can't pair an odd number of things. 

When a system has a singular Poisson structure, the Poisson tensor will have 
at least one null eigenvector. Let's say it has n of them; note that n is equal to 
the dimensionality of the phase space, N, minus the rank of the Poisson tensor, 
r. In this case, it has been shown by Littlejohn jHI] that it is always possible 
to find a set of n = — r scalar phase functions whose gradients are those 
null eigenvectors. This is not at all obvious and requires an application of the 
Frobenius theorem of differential geometry, where use is made of the fact that 
the Poisson tensor satisfies the Jacobi identity. 

These n scalar phase functions are very special in that their bracket with any 
other scalar phase function must vanish. This is obvious from Eq. fj2.1U2|) . Scalar 
phase functions with this property are called Casimir functions. In particular, 
their bracket with any Hamiltonian is zero, so they are always conserved quan- 
tities; note that their conservation follows directly from the bracket structure, 
independent of the particular Hamiltonian under consideration. 

For the free rigid body problem presented above, the null eigenvector of the 
Poisson tensor is any multiple of m itself. The function: 



is then a Casimir function since its gradient is in the direction of m, and we 
recognize it as the total angular momentum squared. Of course, any other 
scalar phase function that is functionally dependent upon C could have been 
used equally well. The pathology at the point m = where the rank of J 



C(m) = m\ + m\ + m\ 
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changes is called a symplectic bone, and is discussed at length by Weinstein [22! • 
If we were to choose C to be one of our generalized coordinates, say the third 
coordinate in place of 7713, then it is clear that the third row and column of J 
would be zero. The two by two submatrix consisting of rows and columns one 
and two would be nonsingular, and Darboux's theorem could be applied to that 
subsystem. Thus, the correct generalization of Darboux's theorem for singular 
Poisson structures is to say that it is always possible to find a transformation to 
a coordinate system for which the matrix of components of the Poisson tensor 
has an r by r submatrix in canonical form with the rest of the entries vanishing. 
For the free rigid body problem Poisson structure given above, this has the form: 

^ 10^ 

-1 (2.154) 
Y y 

for m 7^ 0, and J = for m = 0. 

It is worth repeating that Casimir functions are conserved for any Hamilto- 
nian. For example, the Hamiltonian: 

H{m) = urns, (2.155) 

where /U is a constant, together with the same bracket used above for the free 
rigid body problem, yields the equations of motion for a classical spin gyrating 
in a uniform magnetic field. That is, mi and 1712 undergo simple harmonic oscil- 
lations, while ms is conserved because it commutes with the Hamiltonian. Note 
that C is a conserved quantity for this system as well, because the bracket is the 
same. In general, the Poisson structure is considered to be a more fundamental 
entity than the Hamiltonian. 

The other two examples presented in the last subsection also have singu- 
lar Poisson structures. It is readily verified that the bracket for Euler's fiuid 
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equations has the Casimir functional 

C = j d^C^^{C, t) ■ [v X v(e, t)] (2.156) 

(the integrand here is called the helicity), and that the bracket for the Poisson- 
Vlasov equations has the Casimir functionals 



C$ = y dZ^{f{Z)) (2.157) 
where $ is an arbitrary function of its argument. 

2.3.5 Phase-Space Lagrangian Techniques 

In this section, we review the phase space Lagrangian formalism; for more details 
on this subject see Littlejohn jHH] and Littlejohn and Gary jSl]. For a system 
with canonical coordinates, q and p, and time-independent Hamiltonian, H{q, p), 
the phase space Lagrangian is given by 

L{q:P,q,p) =p - q - H{q,p), (2.158) 

where a dot denotes differentiation with respect to time, t. Note that L may 
depend upon all the phase space coordinates and their time derivatives, unlike 
ordinary configuration space Lagrangians, L{q, q). The associated action is 

A = J dtLiq,p,q,p), (2.159) 

the variation of which yields the Euler-Lagrange equations 

dt \dq J dq \ dq J dq ^ 

and 

d fdL\ dL f dH\ dH ^ -.^ 



dt \dp J dp \ dp J dp ^ 

these are recognized as the canonical equations of motion. 
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We denote by z^, where ^ = 1, . . . , N, (where N = 21) the coordinates of 
phase space. The phase space Lagrangian may then be written 

L{z,z)=-i^z^ -H{z), (2.162) 

where the covector whose components are 7^ will be called the action one-form. 
For the canonical coordinate system used above, these components are 

T.^!"" ''^=''-'' (2.163) 
[0 if ;U = / + !,..., iV. 

The fact that / of these components are zero is a manifestation of the fact that 
the coordinate system is canonical. For more general coordinate systems this 
will not be true, as we shall see shortly. Note that phase space Lagrangians 
are always linear in i. Also note that knowledge of the action one- form and 
the Hamiltonian is completely equivalent to knowledge of the phase space La- 
grangian by Eq. ()2.162|] . 

The equations of motion may be written in this notation as follows: 

^ d f dL\ dL 



or 



where we have defined the Lagrangian two-form 



dt V9z> / ^z^' 

dt ^"'^ ^ dz^^ 
= (7m,. - i'^ + ^ (2.164) 



dH 

u,^z^ = ^, (2.165) 



UJ 



- (2.166) 



or 



(2.167) 
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For z = {q,p), where q and p are canonically conjugate, it is easily verified that 
Eq. is equivalent to Eqs. fl2J6n|l and (1061^ . 

We can recover the more familiar Hamiltonian formalism in the following 
manner: Assuming that [ujp,^] is a nonsingular matrix, we denote its inverse by 
J^^, so 

J^^^p, = 5^ (2.168) 

Then Eq. ()2.165p becomes 

^ jt^^ (2.169) 

These are recognized as Hamilton's equations if we identify J^" as the Poisson 
tensor. That the Poisson tensor is antisymmetric and obeys the Jacobi identity 
is easily verified. In particular, the Jacobi identity follows directly from duj = 
dd-f = 0. 

Under a (possibly noncanonical) transformation of phase space coordinates, 
z ^ the action one-form transforms in the usual fashion of a covariant vector 
to give 

r„ = l^l,. (2.170) 

Similarly, the Lagrangian two-form transforms like a second rank covariant ten- 
sor 

dz^ dz^ 

^fj-'' = 'q^'q^^Cv =^'^,f^ ~^f^,^7 (2.171) 

where the commas in Eq. (|2.17ip denote partial differentiation with respect to 
Z. The Hamiltonian, of course, transforms as a scalar, K{Z) = H{z). The new 
equation of motion is then 



dK 

^^^^^ = (2-172) 



which may be compared to Eq. ()2.165|1 . 

Note that all of the above considerations assume a time-independent Hamil- 
tonian. This restriction is not important for two reasons: First, we could always 
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work in extended phase space to treat a time-dependent system; this is the ap- 
proach taken by Littlejohn and Gary jHlj. Second, aU of our relativistic equations 
of motion wiU have the single-particle proper time as the independent variable, 
and nothing depends explicitly on this. 
The transformation 

dS 

7e^7e + ^, (2-173) 

where S is an arbitrary scalar field on extended phase space, is called a La- 
grangian gauge transformation. Though it alters the action one-form, it is easily 
seen to have no effect on the Lagrangian two-form, and so it does not change 
the equation of motion, Eq. fj2.165p . 

It is clear that if L is independent of one of the extended phase space coordi- 
nates, say z^, then the associated canonical momentum, dL/dz^, is conserved by 
Noether's theorem. Note, however, that a gauge transformation, like Eq. fj2.173p . 
using a scalar field, S, that depends upon the ignorable coordinate, could de- 
stroy the Noether symmetry, even though the associated momentum would still 
be conserved. The same is true for coordinate transformations like Eq. fj2.17Up . 
Conversely, we see that it may be necessary to perform gauge or coordinate 
transformations in order to uncover Noether symmetries and, hence, to discover 
conserved quantities. 

The strategy for our treatment of the guiding- center problem will be to start 
with the phase space Lagrangian for a single relativistic charged particle in an 
electromagnetic field, and, via a sequence of gauge and coordinate transforma- 
tions, find a representation in which the gyroangle, 6, is ignorable. This is the 
Noether symmetry for the gyromomentum. When this is achieved, the gyroangle 
will no longer appear in the equations of motion for the other variables, and the 
magnetic moment will appear only as a constant parameter like the rest mass. 
Thus, in this system of "gyrocoordinates," the rapid oscillatory motion is effec- 
tively decoupled from the slower guiding-center motion, and the dimensionality 
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of our phase space is reduced by two. 
2.3.6 Constrained Systems 

Eqs. ()2 .16211 and fj2.159|l may be interpreted as follows: The variation of the 
action one form must vanish, subject to the constraint that the Hamiltonian is 
constant. By including other constraints, besides the fact that the Hamiltonian 
is constant, we can discover new and interesting Poisson structures that have 
those other constraints "built in." 

For example, consider a particle that is constrained to move on the surface 
of a sphere of radius r. To model this system, we take the canonical action one 
form, 

7 = p • (ir = pxdx + Pydy + Pzdz, (2.174) 

and vary it subject to the constraints that the Hamiltonian, H, be constant, 
that the particle position be on the sphere 

\r\^ = + + z"^ = r^, (2.175) 

and that the particle momentum be tangent to the sphere 

r-p = xpx + ypy + zp:, = 0. (2.176) 

The constrained variation may be done in any one of a number of ways; e.g. 
by use of Lagrange multipliers. Thus we write 

L = p-r- ^Ailrp-Aar-p-if, (2.177) 

and form the Euler-Lagrange equations 

P = -Air - A2P - (2.178) 
dH 

O^r-X^r - —. (2.179) 
dp 
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Dot the first of tfiese equations witfi r to get 

r ■ 

dr 



dH 

r-p = -Ai|r|2-r-— , (2.180) 



from which it follows that 



1 f dH\ 
^i = -;:2r-(p+^)- (2.181) 



Then dot the second with r to get 

dH 



= r-r-A2|rP-r- (2.182) 

op 







[ X2 j 


1 r2 ( 



from which it follows that 

1 f dH\ 

Note that Eqs. ()2.18H1 and (12.1 83|1 may be written in the form 

■({p,r} + l) -r-{p,p} \ f dH/dr 

r-{r, r} — r-({r, p} — 1) / \ dH/dp 

(2.184) 

To get the Poisson brackets, first substitute the Lagrange multipliers, (|2.181|) 
and (|2.18;^ll . back into the equations of motion, (|2. 17811 and (|2. 17911 . We get 

rr\ / rr\ dH 

i-;i)--=(i-;i)-a^ P^sb) 

and 

1 ' (l ^ ( ' ^2 186) 

Note that these two equations do not determine the motion completely; they 
give only the projection of the motion on the sphere. To fully determine r and 
p, we need to employ the derivatives of the constraints. 



r ■ r = 



(2.187) 
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and 



Using these, we finally get 



and 



r-p + r-p = 0. (2.188) 



rr\ dH 

1 ") ■ 



rr\ dH 1 , , dH 



These equations of motion are Hamiltonian with the quadratic Poisson structure 

{r\H} = 



{r\pj} = 5 



r Tj 

J J.2 



= (2.191) 

Note that the constraints, Eqs. fj2.175|l and fj2.176|) . are Casimir functions of this 
Poisson structure. This means that the Hamiltonian equations of motion will 
yield dynamics that respect these constraints for any Hamiltonian whatsoever. 

There is another approach to deriving the above set of brackets. We could 
have adopted the spherical coordinates, 

r = Va;2 + 2/2+^2 (2.192) 

e = arctan(v'a;2 + 1/2/^) (2.193) 
4> = arctan(y/x), (2.194) 
on 3?^. These have the canonically conjugate momenta 



Pr = {xpx + yPy + ZPz)l V^x2 + |/2 _|_ ^2 (2.195) 

PQ = z{xpx + ypy)/ a/x2 + y2 (2.196) 
P4, = xpy-ypx, (2.197) 
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as is easily verified. The advantage to using these spherical coordinates is that 
the constraint surface in phase space is simply described by setting pr equal to 
zero, and r equal to a constant. 

Now we can write 

r = xyi + yy + zz 

= cos + sin ^ cos + sin ^ sin (/)y (2.198) 

and 

= {pr cos9 — — sin 9)z 
r 

+ {pr sin 9 cos (p + — cos 9 cos (f) — — esc 9 sin (^)x 

+( Pr sin 9 sin (p + — cos 6' sin (/> + — csc^cos^)y. (2.199) 
r r 

Eqs. ()2.198j) and fl2.199|l and the Leibniz rule allow us to compute the brackets 
for the system of coordinates (r, p) in terms of the brackets for the system of 
coordinates (r, 6*, ^, Pr, Pe,P(j))- If we ignore the constraint, then the latter system 
is canonical, and it follows that the former system is also canonical. If, on the 
other hand, we incorporate the constraint by dictating that r and pr are Casimir 
functions and that Pr = 0, then the brackets (j2.191|) follow immediately. 

It is interesting to contrast these two methods for obtaining the brackets 
(j2.19ip . We shall use these methods when we cast our guiding-center equations 
of motion in gyrogauge and boostgauge invariant format, towards the end of 
the Chapter 3. Our guiding-center Poisson brackets will also have a quadratic 
Poisson structure, similar to that of the above set of brackets. Such quadratic 
Poisson structures seem to arise naturally from this type of manipulation. The 
reader who is interested in pursuing this topic further is encouraged to read 
about Dirac's theory of constraints j35j . 
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2.4 Lie Transform Perturbation Theory 

2.4.1 General Discussion of Lie Transforms 

Recall that we first introduced coordinates on manifolds using the concepts of 
charts and atlases. A chart is a one-to-one map from a region of 3?"^ to a region 
of an n- dimensional manifold. Each coordinate, z", may thus be thought of 
as a function on the manifold. When we change coordinates, we are effectively 
transforming these functions. 

Consider an infinitesimal transformation of coordinates given by 

Z'^ = z'^ + h{Cgz)°' = z"" + hg''{z), (2.200) 

where h is an infinitesimal, g is a vector field, and the Lie derivative acts on the 
coordinates as though they were scalar functions. From our geometrical inter- 
pretation of the Lie derivative, we see that we are effectively taking the functions 
that define the coordinates, and sliding them an infinitesimal parameter interval, 
h, along the field lines of g. The inverse transformation is 

z"" = Z'^ -h{CgZ)'^ = Z'^ -hg'^iZ). (2.201) 

Of course, since h is an infinitesimal, we are scrupulously ignoring anything of 
order h"^. 

Now we ask how basis vector components behave under the above transfor- 
mation. Assume a coordinate basis for simplicity. We have 

d dz^ d d , dq^ d 



dZ^dzP dz"^ dz'^dzP' 
Similarly, basis covector components transform as follows: 

dZ'^ = -m^dz^ = dz"^ + h^dz^. (2.203) 
OZP ozP 

Now suppose that t is some tensor field on the manifold. We can ask how 

the components of t behave under the above transformation. Use a prime to 
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distinguish the components of t in the new coordinate system. We demand 

t'pl::2 (^) 1^ ® ■ ■ • ® ®dz^' dz^^ 



dz 

d d 
(g) ■ ■ ■ (g) — 



^'tlZuli^)!^ ® ■ ■ ■ ® ® dz"' ® ■ ■ ■ ® dz"^ .{2.204) 



Now expand in h, retaining only first order terms. We find 

^ VX...VS ''u^...Vs "'Vvx...Vs,ay 

+tZ-,:X9'',., + ■■■ + t^.l:::li:.,c.9'',J- (2.205) 

Suppose that we define a new tensor field, T, whose components in the old 
system are the same as those of t in the new system. Then, by comparison with 
Eq. ()2.43p . we may write 

T = t- h{Cgt), (2.206) 

where comparison with Eq. ()2.43p is helpful. Furthermore, since this last equa- 
tion is in coordinate-free form, it is true for coordinate bases and noncoordinate 
bases alike. 

Compare the signs of the second terms on the right-hand sides of Eqs. ()2.206p 
and fj2.200|l . Despite the algebra that went into proving the above result, it has 
a marvelously simple geometric interpretation. If we slide the values of the 
coordinates one way along a field line of g, then we must slide the tensor field in 
the other direction. In case this is not obvious, a trivial example is afforded by 
a scalar field on 3?, call it f{x). If we transform coordinates to X = x + h, then 
F{X) = f'{X) = fix) = fix -h) = fix) - hidf/dX)iX) = if- hCf)iX). 

Suppose that our tensor field is the tensor product of two tensor fields, say 
t = ti ® t2. Then, since Lie derivatives obey the Leibniz rule over the tensor 
product, we have 

T = t - hiCgt) 
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= ti ® t2 - /i£g(ti ® ta) 

= ti ® t2 - /i(/:gti) ® t2 - Ml o (^gts) 

= [ti - h{Cgti)] [t2 - h{Cgt2)], (2.207) 

where, as always, we neglect 0{h'^). This result indicates that the infinitesimal 
transformation commutes with the tensor product. 

Next suppose that the tensor field is obtained by starting with a tensor of 
higher rank and applying to it some number of vectors and/or covectors. For 
example, say t = s(a, U) where a is a covector field and U is a vector field; 
we could have let s have more than one of each type of argument or other 
unfilled slots without affecting the following reasoning in any way. Apply the 
transformation, and use Eq. fl2.44j] to write 

T = t - h{Cgt) 

= s(a,U)-/i£3[s(a,U)] 

= s(a, U) - h{Cgs){ai, U) - hs{Cga, U) - /is(a, CgV) 

= {s- hCgs){a- hCga.V - hCgV). (2.208) 

This result indicates that the transformation commutes with the application of 
the vectors and/or covectors. 

Next suppose that the tensor field is an exact form. That is, say t = dfl. 
Since Lie derivatives commute with exterior derivatives, it follows that the trans- 
formation commutes with the application of the exterior derivative. 

The above results indicate that any tensorial relationship, including those 
with differential operators, retains its form under a transformation of the form 
given in Eq. ()2.200|1 . This crucial point makes the Lie transform method possible. 

Now suppose that we wish to consider finite (rather than infinitesimal) 
changes of coordinates. That is, suppose we wish to slide the coordinate values 
a finite parameter interval, e, along the field lines of g. The easiest approach is to 
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divide the finite interval into a large number of infinitesimal intervals by writing 

Z = lim (1 + 47^9)^^ = exp{eCg)z. (2.209) 
The finite transformation of the tensor, t, is then 

^ = J™ ^1 - ^^^^'^^ = ^M-e^g)^- (2.210) 

A/ — >oo iV 

The transformation given by the above equations is called a Lie transform gen- 
erated by the vector field, g. 

Because the infinitesimal transformations of the form given in Eq. ()2.2()()j) 
are known to preserve tensorial relationships, and because a Lie transform is 
composed of nothing more than a large number of these infinitesimal transfor- 
mations, it follows that Lie transforms preserve tensorial relationships. That 
is 

exp{~eCg){ti ® ta) = {exp{-eCg)ti) ® {exp{-eCg)t2), (2.211) 

and 

exp(-e£g)[s(a,U)] = [exp(-e/:g)s] (exp(-e£g)a, exp(-e/:g)U), (2.212) 

and 

exp{-eCg){dn) = d{exp{-eCg)n). (2.213) 

We now have a way of making finite coordinate transformations of any tensorial 
equation that is guaranteed to preserve its tensorial form. 

By Taylor expanding the exponential in Eq. fl2.2()9j1 and using Eq. ()2.3()j1 for 
the Lie derivative, it is possible to develop the transformation to arbitrarily high 
order in e. In practice, we want to be able to control the transformation order 
by order in e. There are two ways to do this. The first, due to Deprit [HEl, is to 
order the generator, g, in e. The second, due to Dragt and Finn is to make 
a succession of transformations like Eq. ()2.209p . as follows: 

Z = expieCg^ ) exp(e2£g2 ) exp{€^Cg^ )---z. (2.214) 
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In this work, we adopt the second procedure, as it was shown by Gary to 
involve fewer terms in the perturbation series at each order. Expanding the 
above equation in e and using Eq. fj2.3Up . we get 

Z = z + eCiz + e\C2 + l^^z + €^{£3 + C1C2 + l^Dz + ■■■, (2.215) 

Z D 

Here we have used Cn to abbreviate Cg^ . The inverse transformation is then 

z = ■■■exp{-e^C3)exp{-e^C2)exp{-eCi)Z. (2.216) 

Developing this order by order, we get 

z = Z- eC,Z - e\C2 - ^Cl)Z - e^{Cs - + ^Cl)Z (2.217) 

The transformation of the tensor t is then 

T = ■ ■ ■exp(-e^£3) exp{-e'^C2) exp(-e£i)t. (2.218) 

Let us suppose that t is given as a power series in the expansion parameter, e, 
so 

t = to + eti+eH2+eHs + ---. (2.219) 

Then Eq. ^TIT^ yields 

T = To + eTi + e^T2 + e^Ts + ■■■, (2.220) 



where 



To = to, (2.221) 

Ti=ti-/:ito, (2.222) 

T2 = t2 - C2to - Citi + ^Cjto, (2.223) 

T3 = ts - Csto - C2ti + C2Cito - Cit2 + Iclti - Iclto, (2.224) 

2 D 



etc. 
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Given any equation written in tensor form, we can now make near-identity 
coordinate transformations to perform perturbation analyses. That is, if the 
equation has the form of a solvable equation plus a small perturbation, we can 
make a Lie transform to coordinates for which the perturbation is removed 
or at least simplified. The form of the generator, g, required to achieve this 
simplification depends on the specific problem, and is chosen order by order in 
the perturbation series. 

Once this process has been carried out to first order, we could continue 
on to second and higher order, or we could regard the first-order problem as a 
new solvable problem and renormalize the perturbation series accordingly before 
proceeding to higher order. The latter strategy is called the superconvergent Lie 
transform procedure; superconvergent perturbation series were first investigated 
by Kolmogorov jHS! • AH this will be made clear by selected examples in the next 
few subsections. 

2.4.2 Lie Transforming a Scalar Field 

Consider the scalar equation 

f{x) = ex^ + 2x - 2c = 0, (2.225) 

where c is a constant and e is our expansion parameter. Let's pretend for a 
moment that we do not know how to solve a quadratic equation. The scalar 
field, /, is ordered in e as follows: 

fo{x) = 2x-2c, (2.226) 

fi{x) = x^ (2.227) 

and fn{x) = for n > 2. 

We wish to perform a Lie transform to a new coordinate, X, for which the 
transformed scalar will be denoted by F. Since we are working in 3?, the gener- 
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ator, g, has only one component. At order zero, use Eq. 1)2.22111 . 

Fo = fo. (2.228) 

At order one, use Eq. ()2.222ll . 

Fi = h-gifl, = x^-2gi. (2.229) 
Thus, we see that we can make -Fi vanish by choosing gi = x'^/2. Moving on to 



second order, we use Eq. (12.223^ . 

3 

F2 = -2g2 - y . (2.230) 

So we can make F2 vanish by choosing g2 = —x^/4. Thus, to order e^, we have 
the Lie transformed scalar equation 

F{X) = 2X - 2c = 0. (2.231) 

This has solution, X = c. Now x is given in terms of X by Eq. ()2.217p which 
becomes 

X = X -tgi-t^{g2-]^gig'i) 

2 2 

= c-^c^ + i-c^-.... (2.232) 

This matches the Taylor expansion of the exact solution to the quadratic equa- 
tion 

x = - (-1 + VI + 2ec) , (2.233) 
e 

to C(e^), as is easily verified. 

Note that there is another solution to the quadratic equation 

X = -{-!- Vl + 2ec) , (2.234) 
e 
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of leading order that our technique does not give. This is because it is not 
continuously connected to the solution of the unperturbed problem as e goes to 
zero. Lie transforms are useful only for near-identity coordinate transformations. 

2.4.3 Lie Transforming a Vector Field 

Now consider the following dynamical system: 

X = y 

y = -x-ex'^. (2.235) 

If we use z to denote {x, y), then this may be written 

z = vo + evi, (2.236) 

where we have defined the vectors, vq = (y, —x) and vi = (0, — x^). We now try 
to Lie transform to new coordinates, Z = {X, Y), in an attempt to get rid of the 
order e term. The transformed vector field is V = Vq + eVi, where Vq = vq, 
and Vi is given from Eq. ()2.222p . 

Vi = vi - £gVo. (2.237) 

Using the formula for the Lie derivative of a vector, the demand that Vi = is 
seen to be equivalent to the following pair of equations: 

['d^-'^d-y)'' = 

and 

These may be solved by the method of characteristics to yield 

9l = lix' + 2y') 

and 
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gf = -^xy. (2.239) 

Note that the characteristic equations for this system are the unperturbed equa- 
tions of motion. This "integration along unperturbed orbits" is a generic feature 
of problems of this sort. 

Now then, the new coordinates are given in terms of the old by 

X = x + ^{x^ + 2y^) 

Y = y-jxy. (2.240) 

The inverse transformation is then 

X = X - -{X^ + 2Y^) 
3 

y = ^+f^^- (2-241) 

Note that we are ignoring terms of order or higher. Now the equations of 
motion for Z are 

X ^ Y 

Y = -X. (2.242) 

These have solution 

X = Xq cos t + Yo sin t 

Y = Yocost - Xosint. (2.243) 

Thus, the solution for z{t) is given by Eqs. (|2.24H1 and (|2.243jl . If desired, the 
initial conditions for Z can be expressed in terms of the initial conditions for z 
using Eq. ^TM^ . 

Frequently, in physical applications of this formalism, it happens that the new 
coordinates have physical significance. For example, in guiding-center theory, we 
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shall find a Lie transform that takes us from the phase space coordinates of a 
particle to those of a guiding center. In such a circumstance, very little is gained 
by expressing the initial conditions of the transformed problem in terms of those 
of the original problem. Instead, the new coordinates acquire their own physical 
significance, and we can speak of "the equations of motion of a guiding center" 
and "the initial conditions of a guiding center," and forget all about the original 
single-particle coordinates. 

For a less trivial example of the vector Lie transform technique, see Ap- 
pendix 10 where the method is used to calculate the gyrofrequency shift for 
two-dimensional nonrelativistic guiding-center motion in a spatially nonuniform 
electromagnetic field. 

2.4.4 Canonical Lie Transforms of a Hamiltonian System 

When using perturbation theory to study a Hamiltonian dynamical system, the 
above technique of Lie transforming the dynamical vector field could be used, 
but there is a serious problem with this approach: There is no guarantee that the 
Lie transform of a Hamiltonian vector field will be another Hamiltonian vector 
field. 

Recall that a Hamiltonian vector field is given by contracting the Poisson 
tensor with the gradient of a scalar function. This suggests the following solution 
to the above problem: Instead of Lie transforming the Hamiltonian vector field. 
Lie transform the Poisson tensor and Hamiltonian separately. This will insure 
that the transformed equations of motion are still in Hamiltonian form. 

Let us examine a little more closely why this should work. Hamiltonian 
equations of motion are given by Eq. ()2.111|1 . If we write 

Z = exp(e£)z, 
J' = exp(— e£)J, 
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and 

H' = exp{-eC)H, (2.244) 

then since our equations of motion are in tensor form, we are guaranteed that 
the new equations of motion wiU be 

a Tji 

Z = J'— . (2.245) 

Furthermore, we are guaranteed that J' is antisymmetric and obeys the Jacobi 
identity because these requirements can also be written as tensorial equations 
(see Eqs. fj2.1U3p and ()2.1U4p . respectively). Thus, Eq. ()2.245p qualifies as a bona 
fide Hamiltonian system. 

We can now prove a marvelous theorem that considerably simplifies the 
work involved in making canonical (bracket-preserving) Lie transformations of 
a Hamiltonian system, and is probably responsible for the popularity of the Lie 
transform technique: A Poisson tensor is a Lie-dragged tensor along any vec- 
tor field that is Hamiltonian with respect to it. Suppose the Poisson tensor is 
denoted by J. Let V be given by 

= r^—, (2.246) 
ozP 

for some (any) scalar field, W. Then the theorem states 

Cvi = 0. (2.247) 

This is easily proved using the formula for the Lie derivative of a second rank 
contravariant tensor. We write 

= _(j«5j/37^ + r^r^^^ + j^«j^"_^)vr,^ 

-J"^Vr,5^(J«^ + J^^), (2.248) 
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where we have used Eq. f|2.24(i|l for V. The first term vanishes by the Jacobi 
identity, the second term vanishes by antisymmetry, and the theorem is proved. 

It immediately follows that a Lie transform along the vector field V leaves 
J unchanged. This is because a Lie transform is the exponentiation of a Lie 
derivative (set £^t„ = in Eqs. (|2.221|1 through (|2.224|1 to recover T = t). 
Thus, Lie transforms generated by Hamiltonian vector fields are always canoni- 
cal. Now Hamiltonian vector fields are in one-to-one correspondence with scalar 
phase functions, W, by Eq. ()2.246|1 . so we have found a way to generate canonical 
transformations with scalars. 

Thus, to perform a canonical Lie transform of a Hamiltonian system, we 
need only to transform the Hamiltonian. Now the Lie derivative of a scalar with 
respect to a Hamiltonian vector field is given by 

CvH = = J"^Vr,/3iy,a = -{W, H}. (2.249) 

Thus, for a canonical Lie transform of a Hamiltonian, we may rewrite 
Eqs. (12.221 j) through (|2.224j) as follows: 

Ko = Ho, (2.250) 

K^ = H^ + {WiHo}, (2.251) 
K2 = H2 + {W2, Ho} + {Wi,Hi} + ^{Wi, {Wi, Ho}} (2.252) 

Ks = Hs + {Ws,Ho} + {W2,H^} + {W2,{Wi,Ho}} + {Wi,H2} 

+ l{Wu{Wi, Hi}} + ^{Wu {Wi, {VTi, Ho}}}, (2.253) 

etc. Here we have denoted the new Hamiltonian by K. 

To see how this is used, consider the following example: We perturb a har- 
monic oscillator Hamiltonian by the addition of a nonlinear term, 

H=liq'+p')-lp^. (2.254) 
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Note that the unperturbed motion osciUates with unit frequency. We can intro- 
duce action-angle variables for the unperturbed Hamiltonian, 

J=liq''+P'') (2.255) 

e = arctan(g/p), (2.256) 



so that 



Thus we have 



and 



H = J - ^J^ [1 + ^ cos(2^) + ^ cos(4^) ) . (2.257) 



Ho = J (2.258) 



1/4 1 
Hi^ — jM 1 + - cos(20) + - cos(40) ) . (2.259) 
2 \ 3 3 



We now try to remove Hi by a canonical Lie transform generated by the scalar, 
Wi (we shall work only to order one in e). We have Kg = Hq, and 

Ki = Hi + {Wi, Ho} = Hi + {Wi,J} = Hi + -^. (2.260) 

Note that we cannot demand that Ki = since that would cause Wi to be 
multivalued (that is, secular terms would appear in Wi). The best that we can 
hope for is to make Ki equal to the ^-average of Hi. That is, 

Ki = --J2. (2.261) 
2 



Then 

' j2[4cos(26')+cos(46')], (2.262) 



de 6 

and this integrates to give 

1 



Wi = [8 sin(2^) + sin(4^)] . (2.263) 
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Using this generator we can work out the transformation equations, and hence 
completely solve the problem (to order e). For now we note that the perturbed 
frequency is given by 

dK 

n = — = l-eJ. (2.264) 
oJ 

Note how the Lie transform has taken us to a new set of coordinates in which 
the perturbation is averaged; that is, independent of the angle variable. Since 
the resulting Hamiltonian depends only on the action variable, it is integrable 
by definition. Furthermore, secular terms were avoided by this absorbing of the 
averaged part of the perturbation into the new Hamiltonian. 

Aforementioned problems of resonant perturbations occur when the unper- 
turbed motion has characteristic frequencies that vary with the action (this is 
true generically, but not in our above example). When this happens, dWi/dO 
can equal a quantity that is oscillatory but whose frequency passes through zero 
on some set of measure zero in phase space. Thus, in some neighborhood of this 
region, problems of secular behavior can develop. Various techniques exist for 
dealing with this problem, but we shall not consider such problematic regions of 
phase space in this thesis. 

2.4.5 Noncanonical Lie Transforms of a Hamiltonian Sys- 
tem 

It sometimes happens that a canonical transformation is not the best way to 
solve a particular problem in perturbation theory. This may be because it is 
best to express the unperturbed problem in noncanonical coordinates for which 
the perturbation alters not only the Hamiltonian but also the Poisson struc- 
ture. This is the case for both the guiding-center and oscillation- center problems 
whose solution forms the core of this thesis. In this case, we must resort to non- 
canonical transformations, but we demand that they preserve the Hamiltonian 
nature of the equations of motion. As has already been pointed out, this can 
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be accomplished by Lie transforming the Poisson tensor along with the Hamil- 
tonian; this means that the vector generator of the Lie transform should not be 
a Hamiltonian vector field. 

Consider once again the harmonic oscillator Hamiltonian, 

H=^iq^+p^). (2.265) 

This time, we introduce a perturbation not in the Hamiltonian but rather in the 
Poisson structure. Suppose that the perturbed brackets are 

{q,p} = 1- ep'^. (2.266) 

Thus we have J = Jq + eJi, where Jq is the canonical Poisson tensor. We wish 
to perform a Lie transform that will restore the bracket to its canonical form. 
We demand 

= j; = Ji - CgJo- (2.267) 

Straightforward computation shows that this imposes only one independent re- 
quirement on the generating vector field, g, namely 

T + (2.268) 

oq op 

It is easy enough to solve this equation; for example, we could take 

3 

9' = Y (2-269) 

and 

g'^ = 0. (2.270) 

This effectively restores the bracket to canonical form, but it alters the Hamil- 
tonian as follows: 

K = H- tCgH = \{q^+ P^) - (2.271) 
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Note that this transformed problem is coincidentaUy the same one that we 
treated in the last subsection. Thus, we could now apply a second (this time 
canonical) Lie transform to finally solve it. Once again, we would find the per- 
turbed frequency, O = 1 — e J. 

The important thing to note here is that g is not a Hamiltonian vector field. 
If it were, there would have to exist a scalar function W such that = dW/dp 
and /3 = —dW/dq. Examination of the mixed second derivatives shows these 
to be incompatible requirements. 

2.4.6 Lie Transforming the Phase-Space Lagrangian 

There is another way to go about making noncanonical transformations of a 
Hamiltonian system that is guaranteed to keep it Hamiltonian. Recall that 
specifying the action one form is equivalent to specifying the Poisson tensor 
(assuming that everything is nonsingular). We can simply take the exterior 
derivative of 7 to get w, and then invert to to get J. These are all tensorial 
relationships, so we could just as well Lie transform 7 and H instead of J and 
H. 

Indeed, there are several advantages to this approach. First, it is easier to 
take Lie derivatives of one forms than of second rank contravariant tensors; there 
is one less term to worry about, and, more importantly, we can use the homotopy 
formula to help us Lie differentiate one forms. Second, when we Lie transform 
the Poisson tensor, we are guaranteed that the resulting tensor will be a valid 
Poisson structure only to the order we are keeping. When we Lie transform the 
action one form on the other hand, its exterior derivative is still going to be 
closed even if we truncate it. Thus to is exactly closed, so J = uj~^ will obey the 
Jacobi identity exactly. 

Consider a Lie transformation of the original action one form, 7, into a new 
action one form, F. Using the homotopy formula, Eqs. fj2.221|) through ()2.224p 
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become 



ro = 70, 



(2.272) 



Fi = 7i - iiwo + dSi, 



(2.273) 



ii{uJi + Oi) + ^5*2, 



(2.274) 



ra = 73 - ^s^^o - ^2^^! - n[^2 - i^dii{uJi + -Oi)] + dSs, 



(2.275) 



etc. Here, we have defined a;,^ = d'fn, and fl^ = dT^. Note that in these 
equations, we have also made near-identity gauge transformations by adding 
dSn at order n for all n > 1. In fact, any other one- forms in these equations that 
were given by the exterior derivative of a scalar (typically arising from the second 
term on the right of Eqs. fj2.75|l and fj2.76|l ). were absorbed in the definitions of 
the Sn- 

Thus, these last transformation equations are capable of dealing with any 
near-identity coordinate or gauge transformations, and so it is these that we 
shall use in the sections to follow. The vectors Qn and the scalars Sn will be 
determined by certain desiderata: We want the transformation to average away 
the rapidly oscillating terms of the Hamiltonian and action one-form, and we 
want to avoid secular terms. For the guiding-center problem, we shall also want 
the action one-form to be invariant with respect to certain transformations called 
gyrogauge and boostgauge transformations. This will be explained in more detail 
later. 

For now, we consider another simple example. Consider once again the har- 
monic oscillator Hamiltonian, and perturb the canonical action one form as 
follows: 



7 = pdq + -p dq. 




We have 
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This inverts to give (1 + ep^)~^ times the canonical Poisson tensor, and to order 
e this is the same as the perturbation that was examined in the last subsection 
(which is why we chose it). We can now compare the two methods of doing the 
problem. 

Demand that Ti = 0, so Eq. (|2.273|) gives 

= Ti = 71 - iiuo + dSi = {^p^ - gP)dq - g'^dp + dSi. (2.278) 
Thus we can take Si = 0, and 




(2.279) 



and 

g'^ = 0. (2.280) 

These are precisely the same generators that we discovered in the last subsection, 
they have precisely the same effect on the Hamiltonian, and the rest of the prob- 
lem follows in identical fashion. That is, a second canonical Lie transformation 
is necessary to get to averaged coordinates. 



Chapter 3 

Relativistic Guiding-Center 
Theory 

3.1 Discussion 

Relativistic guiding-center motion occurs in many applications of plasma physics, 
including controlled fusion, free-electron lasers, and astrophysics. The tandem 
mirror and bumpy torus plasma confinement devices, for example, utilize pop- 
ulations of magnetized electrons at relativistic energies in complicated field-line 
geometries. In free-electron lasers, relativistic electron beams travel along strong 
magnetic fields with superposed wiggler fields. Near a neutron star, relativistic 
plasma can be confined in strong electromagnetic and gravitational fields. 

All these examples point out the need for a formalism that is able to treat gen- 
eral electromagnetic field geometries. Particle simulation codes used for studying 
the properties of guiding-center plasmas in controlled fusion confinement devices 
sometimes require the guiding-center equations of motion to one order higher 
than the usual drifts; this indicates the need for a simplified and systematic 
perturbative treatment, such as that afforded by the use of Lie transforms. The 
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free-electron laser problem has no obvious preferred frame of reference, and this 
suggests that a manifestly covariant description would best reveal the essence of 
the physical processes involved. The neutron star problem involves coupling to 
a general relativistic gravitational field, and this absolutely requires a manifestly 
covariant formulation. All these desiderata will be satisfied by our theory. 

Nonrelativistic theories of guiding-center motion in arbitrary magnetic ge- 
ometry frequently make use of orthonormal triads of unit vectors at each point 
of three-dimensional physical space. One member of each such triad is required 
to lie in the direction of the magnetic field at that point. Such a basis affords 
great clarity and relative ease in the computation and exposition of the results 
of guiding-center theory. 

One of the first problems to be addressed in any relativistic formulation of 
guiding-center theory is thus that of finding the relativistic analogs of these basis 
triads. Fortunately, this problem has been solved by Fradkin ^^l, who gives a 
straightforward method for finding orthonormal tetrads of unit vectors at each 
point of four-dimensional spacetime. In a frame for which the perpendicular 
electric field vanishes, one pair of unit vectors in these tetrads lies perpendicular 
to the magnetic field, while the other pair spans the two-dimensional subspace 
determined by the direction of the magnetic field and the direction of time. 

Fradkin shows that these two two-dimensional subspaces are covariantly de- 
fined, and that the rapid gyration takes place in the first of these, while the slower 
parallel motion takes place in the second. This formalism is therefore useful for 
isolating the oscillatory motion so that it can be effectively averaged to obtain 
the guiding-center equations of motion. It is described from first principles in 
Sections EH EH and HOI 

Lie transform perturbation theory is used to perform the averaging. Though 
this technique has been known for some time jlHl: its use for the guiding-center 
problem poses special difficulties which were first overcome by Littlejohn |22j . 
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The difficulties are due to tfie fact tfiat tfie Poisson structure as well as the 
Hamiltonian depends upon the rapidly gyrating variables, so that the transfor- 
mation required to gyroaverage the system of equations is not canonical. 

A Lie transform in its most general sense is a coordinate transformation 
generated by a vector field on phase space. If this vector field generator is a 
Hamiltonian vector field (that is, a vector field that is the flow generated by some 
scalar Hamiltonian-like function) then the transformation it induces is canonical; 
in this case one often simply speaks of the transformation as being generated 
by the corresponding scalar function. For the guiding-center problem, however, 
the vector generator of the averaging transformation cannot be a Hamiltonian 
vector field, since it must generate a noncanonical transformation. 

In the nonrelativistic guiding-center problem, it was found by Littlejohn [2j 
to be easiest to apply the general Lie transform to the action one form. This 
is the approach that is followed here; it was described from first principles in 
Chapter El 

In any calculation that goes beyond the lowest order drifts, it was found 
by Littlejohn j3T] to be necessary to worry about maintaining a certain gauge 
invariance property of the action one form which for the nonrelativistic case is 
known as gyrogauge invariance. If the averaging transformation does not preserve 
this invariance property, then the final guiding-center equations of motion will 
depend unavoidably on the arbitrarily chosen basis vectors used to set up the 
problem, as was noted by Hagan and Frieman [321 ■ In Section 13.61 we work out 
the relativistic generalization of this invariance property, and we find that the 
relativistic case admits another similar gauge invariance property which we call 
boostgauge invariance. 

The Lie transforms are carried out in Sections 13. 7^ 13.81 and 13. 9| and the 
guiding-center Lagrangian and Hamiltonian are presented. The Poisson bracket 
structure is then given in Section I3.1UI and the equations of motion are pre- 
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sented and discussed in Section 13.111 In Section \'A.12\ a complete summary of 
the transformation equations is given for reference and the correction to the 
gyromomentum is derived. In Section f3. 131 we show how to write our results in 
"1 + 3" notation, and we compare our results to those of Northrop [321 ■ In Sec- 
tion l3.l41 we cast all our results in manifestly gjrogauge and boostgauge invariant 
format. 



3.2 Conventions and Notation 



In this work, we adopt the following conventions: The particle space-time co- 
ordinate will be denoted by r'^, where fi = 0, . . . , 3. The Minkowski metric, 
9^iu = diag(— 1, +1, +1, +1), is used throughout our derivation of the guiding- 
center equations, but the results will be written in manifestly covariant form so 
that this assumption can be relaxed. The four potential is given by A^^ = [(f), A), 
so the antisymmetric field tensor is -F = dA, or 



A 



A 
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(3.281) 



(3.282) 



where e^i^cep is the completely antisymmetric fourth rank Levi-Civita tensor with 
^0123 = +1- Note carefully that e^^^s _ _-|^^ thanks to the Minkowski metric. 
It is often convenient to use "1-1-3" notation. Then, the matrix of components 
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of the mixed field tensor, F^^^, may be written 

(3.283) 

and that of the mixed dual field tensor, JF'^j^, may be written 

( -B \ 

= . (3.284) 





Note that we have used the notation (1 x B)^-, = eikiSjkBi = eijiBi. Also note 
that the mixed field tensors are neither symmetric nor antisymmetric. The 
advantage to dealing with the mixed tensors is that one may contract them with 
other tensors using ordinary matrix multiplication. Of course, we could equally 
well do this with the completely covariant or contravariant forms, but we would 
have to remember to use the Minkowski metric when multiplying a row by a 
column. 

Thus, when the field tensor is applied to an arbitrary four- vector, the result 
may be written 

F- f = f ^'^ I . (3.285) 

\ay \aE + axB 

The analogous equation for the dual field tensor is 

^. I I = [ ^ I . (3.286) 

y a y y -aB + a X E 

This "1 + 3" notation will prove to be useful and convenient throughout the 
remainder of this thesis. 

The two familiar Lorentz scalars can be expressed in terms of these tensors 

by 

Ai = ^F^^F'"' = ^F:F = B^-E'', (3.287) 

and 
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A2 = = : F = E ■ B. (3.288) 

Note carefully that F : F = F^^F^""" = -F^^F^^^ = -Tr(F ■ F). 
The Lorentz equation of motion may then be written 

du e , ^ , ^ 

m— = -F(r) ■ u, 3.289) 
dr c 



where 



dv 

u = — (3.290) 
dr 



is the four-velocity, r is the proper time, m is the rest mass and e is the charge. 

Equation ()3.289|1 makes it clear that if the field is independent of space- 
time position, then the frequencies of the motion are the eigenvalues of F times 
—ie/mc. Now the characteristic equation for the matrix F is 

det(F - Al) = + AiA^ - A^ = 0. (3.291) 

This biquadratic in A is easily solved to give A = iA^;, or A = izA^, where we 
have defined the Lorentz scalars 



Xe = sgn(A2)^/^(VA? + 4Ai-Ai), (3.292) 



and 



Xb = \/^(\/A?+4Ai + Ai). (3.293) 



We can write Ai and A2 in terms of A^ and As as follows: 

Al = A|-A|, (3.294) 

and 

A2 = XbXe. (3.295) 
We can now define the two Lorentz scalars 

Qe = (3.296) 

mc 
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and 



Qb = (3.297) 

mc 



The first of these is the inverse of the characteristic proper time required to 
accelerate to relativistic velocities along field lines, while the second is the gy- 
rofrequency with respect to proper time. 

3.3 The Electromagnetic Projection Operators 

In this section, we summarize the work of Fradkin ^Hj that is relevent to this 
study. It is straightforward to verify the following identities: 

- J^'^ = -Xil, (3.298) 

and 

F ■ = ■ F = -X2I. (3.299) 
Premultiplying the first of these by F, and employing the second gives 

F^ = -AaJ^ - AiF. (3.300) 

Premultiplying by F once again gives 

F^ + XiF^ - Xjl = 0. (3.301) 

Comparing this with Eq. fj3.291|l . we see that we have proven that F obeys its 
own characteristic equation, as it must by the Hamilton- Cay ley theorem. Now 
it is clear that Eq. ()3. 30111 may be written as follows: 

(F - AeI) ■ (F + XeI) ■ {F - zAfil) ■ (F + zA^l) = 0, (3.302) 

and the four factors in this expression commute, so any of them could have been 
written first. Thus, if \1/ is an arbitrary column four-vector, then 



{F-Xe1) 



(F + XeI) ■ (F - iXel) ■ (F + zAsl) ■ ^' 



0, (3.303) 
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so that (F + Ae1) ■ (P — ^Asl) ■ (F + zAsl) is an (unnormalized) eigenvector of 
F with eigenvalue Xe- Thus, the operator {F + XeI) ■ {F — iXsl) ■ {F + iXsl) is 
a (unnormahzed) projection operator that projects arbitrary four-vectors onto 
the vector subspace spanned by the zeroth eigenvector of F. Proceeding in this 
manner, it is easy to see that the projection operator 



+ A^ 



projects arbitrary four-vectors onto the vector subspace spanned by the eigen- 
vectors of F with eigenvalues ±As, while the projection operator 



= ' lit (3.305) 



projects arbitrary four-vectors onto the vector subspace spanned by the eigen- 
vectors of F with eigenvalues izAs. The normalization constants were chosen to 
make the projection operators idempotent; that is 

P||-P|| =P||, (3.306) 

P±-P± = P±: (3.307) 

P\\ ■P± = P±- P\\ = 0, (3.308) 

and 

P|l +P± = 1. (3.309) 

We have thus decomposed the tangent space at each point of space-time into 
the Cartesian product of two two-dimensional "two-flats." The rapid gyromotion 
takes place in the perpendicular two-flat since it is spanned by the eigenvectors 
corresponding to the imaginary eigenvalues, while the parallel motion takes place 
in the parallel two-flat since it is spanned by the eigenvectors corresponding to 
the real eigenvalues. These two-flats will play an indispensible role in our theory. 
We shall use them to isolate the gyrational components of the particle velocity 
in preparation for the guiding-center Lie transform. 
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In Section l3.7| we shall order the fields in an expansion parameter and, for 
reasons that will be explained at that time, we shall demand that our lowest- 
order field have A^; = 0. Furthermore, the two-fiats that we shall use will always 
be defined in terms of the zero-order field; that is, the field tensor that appears 
on the right hand side of Eqs. ()3.3U4p and ()3.3U5p is always the lowest-order field 
tensor with Xe = 0. Thus, these equations can be simplified to read 

P\\ = ^ + YT = TT (3-310) 

and 

P^ = - =X- (3.311) 

^B 

In "1 3" notation, Eqs. (|3.31()|1 and (|3.311|1 become 

1 / S2 -E X B 

and 

1 / E X B 



(3.313) 



S2-£;M -E X B -BB-EE + S^]^ 
Henceforth, all our results concerning the nature of the two-fiats and the unit 
vectors that span them will contain this assumption that the underlying field 
tensor has Ae = 0. 



3.4 The Orthonormal Basis Tetrad 

We wish to show how to construct a tetrad of unit vectors such that one pair 
spans the parallel two-fiat while the other pair spans the perpendicular two-fiat. 
Clearly such a tetrad is not unique; it is defined only to within an arbitrary rota- 
tion in the perpendicular two-fiat, and an arbitrary hyperbolic rotation (boost) 
in the parallel two-flat. We shall have much more to say about this nonunique- 
ness later; for now we are simply looking for a way to construct any such tetrad. 
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From the arguments presented in the last section, we know that one way to 
do this is to examine the eigenvectors of the field tensor. Here we shall take a 
different approach that is perhaps more physically motivated. Recall that we 
are dealing with fields for which i^y = (if this is true in any one frame, it will 
be true in all frames because E ■ B is a Lorentz scalar). There exist a set of 
local "preferred" reference frames for which Ej^ also vanishes; hence there is no 
electric field at all in these preferred frames. Thus, in a preferred frame, the 
field tensors may be written in "1 + 3" notation as follows: 

(3.314) 

\ 1 X B / 

and 

I " " I . (3.315) 





Also, in a preferred frame, the projection operators have the form 

P\\=\ ^ ° (3.316) 




and 




where 



(3.317) 



b = B/|B|. (3.318) 



The above forms for the projection operators in a preferred frame make it 
clear that we can choose the following orthonormal basis tetrad for a preferred 
frame: 









: 




: 









eo = h ei = , (3.319) 
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and 

62 = 1^ j , es = 1^ j , (3.320) 

where ti and T2 are unit three- vectors perpendicular to b, such that {b, ti, r2} 
constitutes an orthonormal triad in three-dimensional space. We reiterate that 
the above choice is not unique. 

Of course, we would like to be able to construct an orthonormal basis tetrad 
in an arbitrary Lorentz frame. To see how to do this, we consider a Lorentz 
boost from the above-described preferred frame to a new frame. The Lorentz 
transformation matrix for a boost is 

A = I ^ I , (3.321) 

1 + (7 - I)f3-^f3(3 J 

where the three- vector f3 is the generator of the Lorentz boost (it is the relative 
velocity of the two reference frames divided by c), and where 7 = (1 — /3^)~^/^. 
This matrix is an element of the Lorentz group because it satisfies = g-A^-g 
(here we have used a superscripted "T" to denote the transpose operation) . See 
Jackson [33] for more details on the Lorentz group and its generators. 
The new field tensor components are then 

f 7/3 X B \ 

F' = A-F-A-i = . (3.322) 

\^ 7/3 X B 1 X [7B - (7 - l)/3-2/3/3 ■ B] J 
In writing this result, we have made use of the vector identity, 

/3 X B/3-/3/3 X B = 1 X (/^^B -/3/3- B). (3.323) 

From this result for the field tensor, we see that we can identify the electric and 
magnetic fields in the new frame as 



E' = 7/3 X B 



(3.324) 
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and 



B' = 7B - (7 - l)/3-^l3(3 ■ B. 



(3.325) 



At this point, there are a number of interesting observations to be made. 
First note that if f3 is paraUel to B then E' = 0, so the transformation takes us 
to another preferred frame. Next note that if (3 is perpendicular to B then B 
is paraUel to B'. Next note that it is possible to arrive at any desired E' by a 
transformation with f3 perpendicular to B. Specifically, if we take 



then it is easy to see that the new electric field is E'. Conversely, if we begin 
with a frame in which the (perpendicular) electric field is E', then a Lorentz 
boost with (3 = (3^ gets us to a preferred frame. 
The orthonormal tetrad in the new frame is then 



(3.326) 



where 



(^E 



E' X B' 



(3.327) 




(3.328) 



and 





7^/3^ ■ T2 

T2 + (7E - l)f3E^l3El3E ■ 




(3.329) 



where 7^ = (1-/31)-^/^. 
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At this point we note that we can choose ti to he along the direction of f3^ 
without any loss of generality. We can now write the results for the unit tetrad 
in the general frame, dropping the primes which are no longer needed because 
all quantities will refer to the general frame. Thus 





eo = h ei = , (3.330) 



and 





62 = . h es = . , (3.331 



where 



^ E X B , , 

f^E ^ (3.332) 



and 7E = (1 — /3%)~^^'^- Here we have also introduced the notation for a unit 
vector in the direction of f3^ if f3^ ^ 0. If = 0, one may choose 0^ to be 
any unit three- vector perpendicular to b. 

Using Eqs. ()3.285|1 and ()3.286p . the following useful identities are readily 
demonstrated: 

F ■ eo = 0, F ■ ei = 0, (3.333) 
F-e2 = -ABe3, F-e3 = +ABe2, (3.334) 

and 

jr.eo = -ABei, J'-ei = -ABeo, (3.335) 
J^-e2 = 0, J^-e3 = 0. (3.336) 

Thus, the field tensor and its dual have the effect of rotating these unit vectors 
within their respective two-flats. 

Using Eqs. ()3. 31011 and fj3.311|l . it is easy to verify that Py leaves eo and ei 
unchanged and annihilates 62 and 63, while P± annihilates eo and ei and leaves 
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62 and 63 unchanged. It is also easy to verify that this tetrad is orthonormal 
with respect to the Minkowski metric; that is, that 

■ = g^„. (3.337) 

So eo and ei span the paraUel two-flat, and 62 and 63 span the perpendicular 
two-flat, as asserted. The geometrical situation is illustrated schematically in 
Fig.O 

In terms of the e^, the projection operators may be written 

P|[ = -eoeo + eiGi (3.338) 

and 

-Pi = 6262 + 6363. (3.339) 

This should be clear from the geometrical picture, but may also be verifled by 
direct algebra. 

When applied to the particle four- velocity, these projection operators will 
allow us to isolate the rapid gyrational motion in the perpendicular two-flat 
from the nongyrational motion in the parallel two flat. Thus 

u = u^e^, (3.340) 

or, if we introduce polar coordinates {w, 9) for the perpendicular four-velocity 
components and hyperbolic polar coordinates (/c, (3) for the parallel velocity com- 
ponents, then we may write 

u = eok cosh f3 + ei/c sinh (3 — sin 9 — e^w cos 9 (3.341) 

or 

u = ki + wc, (3.342) 

where we have deflned 

t = eo cosh /9 -I- ei sinh /?, (3.343) 

and 




Figure 3.1: The Orthonormal Basis Tetrad 
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If we also define 



and 



c = —62 sin 6' — 63 cos 6'. (3.344) 
b = Go sinh /3 + ei cosh (3.345) 
a = 62 cos 6' — 63 sin 6*, (3.346) 



then (t, b, c, a) form a new velocity-dependent basis tetrad that is also orthonor- 
mal with respect to the Minkowski metric. Please do not confuse the basis four- 
vector b with the basis three-vector b, and do not confuse the hyperbolic polar 
coordinate P with the Lorentz transformation generator f3. 

Some useful relations among the elements of this new basis tetrad are 

i=b, i=t, (3.347) 



dc _ dk _ p. 



(3.348) 



and 



F ■ t = 0, F ■ b = 0, (3.349) 

F-c = -Aija, F-a = +Asc, (3.350) 

and 

T-i = -\Bh, T-h = -\Bt, (3.351) 

J^-c = 0, J^-a = 0. (3.352) 
Also, the projection operators may now be written 

P|l = -tt + bb (3.353) 

and 

P^ = cc + aa. (3.354) 

It is useful to compare the above description of the four-velocity in terms of 
(/c, (3, w, 9) with the more conventional "1 + 3" representation, u = 0(7^, 7^/3„), 
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where = v/c. We shall do this using the unit tetrad that we constructed 
above. Combining Eqs. iKM^ . (fOTH) and iKMl} . we find 

C'jy = 'jEik cosh (3 — (3ew sin 9) (3.355) 

and 

C'jyfS^ = 7£;(/3^A; cosh (3 — /3^w sin 9) — b x f3^w cos 9 + hk sinh/3. (3.356) 
From these equations, it follows that 

k sink 3 

Pvi = /3.-b= ; — . (3.357 

'-fE[k cosh p — pEWsm9) 

n _ a a PEk cosh (3 -w sin 9 ro okq\ 

Pv2 = f3^-f3E = — — 3.358) 

k cosh p — Pew sm 9 

/3„3 = /3.-(bx/3^)= 7"^; — - (3.359) 

7£;(A; cosh p — Pew sm y) 

and 



k = c7E7„^/l-/32i-2/3i5/3„2 + /3|/32i+/3|/322 (3.360) 

-1 / Pvl 



(3 = tanh-^ I, n n A (3.361) 
\7i?(l -/3s/3„2) 



w = c^Elv^(3l2 + K^-WE(3.2 + (3l-(3l(3l^ (3.362) 
6* = arg(-/3„3 -z7i7;(/3„2 -/3e)) (3.363) 

Note that the four coordinates (/c, /3, ly, 6*) obey the constraint k^ —w"^ = c^, and 
this is why they can be determined by the three components of /3^. Naturally, 
the above transformation equations depend upon the choice we made for the 
unit tetrad. This arbitrariness will be discussed further in Section 13.61 These 
transformation equations will be most useful when we want to compare our 
results to those of other authors who have used "1 + 3" notation; this will be 
done in Section IB. 131 
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3.5 Phase Space Lagrangian for a Charged Par- 
ticle in an Electromagnetic Field 



For a relativistic charged particle in an electromagnetic field, one possible choice 
for the Hamiltonian, H, in canonical coordinates, {q,p), is given by [331 

H{q,p) = ^(p--A{q)y , (3.364) 
2m V c / 

and the action one form for canonical coordinates is, by Eq. ()2.163|) 

7 = p ■ dq. (3.365) 

Note that the independent variable is the particle's proper time; the equations 
of motion are thus of the form of Eq. (|2.165|1 , but the dot in that equation now 
denotes differentiation with respect to proper time. 

We begin by making a noncanonical transformation to the new coordinates 
(r, w), where 

r = q 

(3.366) 

Thus we have eliminated the unphysical canonical momentum, p, in favor of the 
particle velocity, u. The new Hamiltonian is 

777 

H'{r,u) = -u^ (3.367) 

and the new action one form is 

7' = (mu + ^^(^)) ■ dr. (3.368) 

If we now use Eq. (|3.342jl to eliminate the four components of u in favor of 
(/c, /3, w, 9), then the new Hamiltonian is 

777 

H"{r,k,(3,w,e) = —{-k^ +w^) (3.369) 
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and the new action one form is 

y = (^mki + mwc + -A(r) j ■ dr. (3.370) 

It is important to remember that t and b are functions of r and and c and 
a are functions of r and 9. Thus, the second term in the parenthesis on the 
right hand side of Eq. ()3.370|1 is rapidly oscillating due to its dependence on 9 
(this will be made more precise shortly). We are now ready to apply the Lie 
transform procedure that will effectively average H" and 7" by transforming to 
gyrocoordinates in which 9 is ignorable. 

3.6 Gyrogauge and Boostgauge Transformations 

We now discuss the afore-mentioned arbitrariness in choosing the orthonormal 
unit vectors, e^. A boostgauge transformation replaces our choices for Bq and ei 
as follows: 

Bq = eo cosh$(r) - ei sinh$(r), (3.371) 
e'^ = ei cosh$(r) - eo sinh$(r), (3.372) 

while a gyrogauge transformation replaces our choices for 62 and 63 as follows: 

§2 = e2Cos^'(r) +e3sin^'(r), (3.373) 
63 = 63 cos ^'(r) - 62 sin ^'(r). (3.374) 

Note that the new unit vectors are still orthonormal, that gq and ei still span 
the parallel two-flat, and that 62 and 63 still span the perpendicular two-flat. 
The gyrogauge and boostgauge transformations have simply given each of these 
two pairs of unit vectors a rotation within its respective two-flat. The amount of 
rotation is measured by $ in the parallel two-flat, and by \1/ in the perpendicular 
two-flat. Note that these can be functions of the particle's spacetime position, 
r. 
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Recall that we used the unit tetrad to decompose the particle velocity into 
parallel and perpendicular parts, and to coordinatize these by (/c, (3) and {w, 9), 
respectively. It is fairly easy to see that the transformation given by Eqs. ()3.37ip 
through (|3 .37411 will have no effect on k and w, but will shift /3 and 9. Hence, 
we add 

/3' = /3 + $(r) (3.375) 
to our boostgauge transformation equations, and 

9' = 9 + ^{r) (3.376) 

to our gyrogauge transformation equatons. None of the other phase space coor- 
dinates are affected by the transformations. 

Equations ()3.371|1 through ()3.376|1 constitute the full gyrogauge and boost- 
gauge transformation equations. A quantity that is left unchanged by these 
transformation equations will be said to be gyrogauge or boostgauge invariant, 
respectively. The concept of gyrogauge invariance has a nonrelativistic analog 
which was first discussed by Littlejohn jH] . In the remainder of this section, we 
shall extend his methods to our relativistic problem. 

To begin with, we note that the unit vectors (t, b, c, a) are all gyrogauge and 
boostgauge invariant. This is demonstrated for t as follows: 

t' = eg cosh /?' + e[ sinh /?' 

= (eo cosh $ - ei sinh $) cosh(/3 + $) + (ei cosh $ - Bq sinh $) sinh(/3 + $) 
= eo[cosh $ cosh(/3 + $) - sinh $ sinh(/3 -|- $)] 

+ei [- sinh $ cosh(/3 + ^) + cosh $ sinh(/3 + $)] 
= eo cosh /3 + ei sinh /3 

= t; (3.377) 
the demonstration for the other three unit vectors follows similarly. Because the 
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parallel and perpendicular projection operators may be written in the form of 
Eqs. ()3 .35311 and ()3.354|1 . their gyrogauge and boostgauge invariance is manifest. 

The fact that the quantities above are gyrogauge and boostgauge invariant 
means that they may be expressed in terms of purely physical tensor quantities; 
more precisely, they may be expressed in terms of quantities that are completely 
independent of our choice of the orientation of the basis tetrad, e^, at each point 
in spacetime. For example, P|| and Pi can be expressed in terms of the field 
tensor, as was done in Eqs. ()3.304jl and fj3.305|l . The gyrogauge and boostgauge 
invariant quantities k and w can be written in terms of the projection operators 
and the particle four- velocity with the help of Eq. (|3.342|1 



/c = ■ w, (3.378) 

and 



w 



^/u-P±- u. (3.379) 



Finally, the members of the tetrad (t, b, c, a) can all be expressed in terms of the 
field tensor and the particle four- velocity, with the help of Eqs. fj3.342p . fj3.350|) . 
and (pTHHTI 

t = ip|| ■ u, (3.380) 

b = --^J^-t, (3.381) 
As 

c = -Pi ■ M, (3.382) 

w 

a = --^P-c. (3.383) 
As 

Now consider the pair of one-forms: 

Q = (Vei) ■ eo = -(Veo) ■ ei = (Vb) ■ t = -(Vt) ■ b, (3.384) 



and 

TZ = (Ve2) ■ eg = -(Ves) ■ 63 = (Vc) ■ a = -(Va) ■ c. 



(3.385) 
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where V is a shorthand for the spacetime gradient. It is a straightforward ex- 
ercise to show that Q is not boostgauge invariant, and that 71 is not gyrogauge 
invariant; this is essentiaUy because the spacetime derivatives are taken at con- 
stant /3 and 9, and these latter two quantities are obviously not boostgauge and 
gyrogauge invariant, respectively. First note that V transforms under a general 
boostgauge and gyrogauge transformation as follows: 

V = V-(V$)^-(V^)^, (3.386) 

where we have made use of Eqs. fj3.375|l and ()3.376p . Thus we have 

Q' = (V b') • t' = [Vb - (V$)t] ■ t = Q + V$, (3.387) 

and 

n' = (V c') ■ a' = [Vc + (V*)a] ■ a = 7^ + V*. (3.388) 

Here we have used Eqs. fl3.347|l and fl3.348|l . The one-forms Q and 71 will be use- 
ful to us momentarily. Furthermore, they have great geometrical significance as 
will become clear later when we discuss the guiding-center equations of motion. 

We now ask what it means for a general one-form in our phase space to 
be boostgauge and gyrogauge invariant. Using Eq. fj2.170|l . we find that the r 
component of the one- form transforms as follows: 

dr op de 

= 7, - (V$)7/3 - (V*)7e, (3.389) 

while all of the other components (/c, f3, w, and 9) are unchanged. Thus it is clear 
that the charged particle Hamiltonian and action one form given by Eqs. (|3.369p 
and ()3.370p are boostgauge and gyrogauge invariant, since they have no f3 or 9 
components. 

Now we demand that our Lie transformations, when applied to gauge invari- 
ant quantities, preserve their gauge invariance. This, coupled with the estab- 
lished boostgauge and gyrogauge invariance of the particle action one-form, will 
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guarantee the boostgauge and gyrogauge invariance of the guiding-center action 
one-form. Suppose that we have a boostgauge and gyrogauge invariant scalar 
field, /. Applying the Lie derivative operator, Cg, we find from Eq. fj2.30p 

CJ = g -Vf + g ^+/^+^ 9^+9 ^. (3.390) 

If we now subject this to a general boostgauge and gyrogauge transformation, 
we find 

+(<,'" -V*-9"")|^, (3.391) 

where we have made use of the assumed gauge invariance of /. Thus, Cgf will be 
gauge invariant if all the components of g are gauge invariant, with the exception 
of g^ and g^ which must transform as follows: 

g'P = / V$ ■ ^7'^ (3.392) 

and 

g" = g' + V<f-g"'. (3.393) 

Thus, if we use a subscripted "0" to denote a gauge invariant quantity, we see 
that the components of the vector g must be of the form 

g"" = {glo 

/ = (/)o 

/ = {g% + Q-iglo 

g"" = (g^o 

/ - (g^+n-ig^o (3.394) 



110 



Relativistic Guiding-Center Theory 



Using the homotopy formula, it is a straightforward exercise to show that this 
result is valid not only for gauge invariant scalars, but also for any gauge invariant 
n-form. In particular, this restriction on the form of g is necessary to guarantee 
the gauge invariance of the Lie transformed action one-form, so we shall demand 
that it hold in the sections to follow. 

3.7 The Zero-Order Problem 

We order the particle Hamiltonian and action one-form with the prescription 
e I— > e/e; equivalently, we could say that we are ordering the electromagnetic field 
at order e~^. The electromagnetic contribution to the canonical momentum thus 
dominates the kinetic contribution. This ordering procedure has been discussed 
at length by Kruskal [IS] and by Littlejohn |41j . 

We shall also order the four potential of the electromagnetic field in the 
parameter e, so 

oo 

A = ^e^Ai. (3.395) 
Clearly, this induces an ordering of the field itself 

oo 

F = ^e^F„ (3.396) 

where 

Fi = dA,. (3.397) 

Henceforth, when we refer to the Lorentz scalars (Ai, A2, A^; and A^) or to the 
unit basis tetrads or to the projection operators, it is to be understood that they 
are calculated on the basis of the zero order field tensor, Fq. 

The Hamiltonian, Eq. ()3.3()9j1 . is thus an order unity scalar. The particle 
action one-form, Eq. f)3.370jl . may be written 

1 °° 

^=7^^'^- (3.398) 

1=0 
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where 70 has the component 



70, = -Ao(r), (3.399) 
c 



71 has the component 



e 



'yi^ = -Ai{r) + mkt + mwc, (3.400) 



c 



and 7i has the component 



7,, = ^Mr) (3.401) 



for i > 2. All components not listed above are zero. 

Suppose that we now write the equations of motion to lowest order as ■ i = 
0, where uq = d'jo. This turns out to be an instructive exercise even though, 
as we shall see in a moment, it is somewhat misleading. We see that the only 
surviving component of lvq is 

LOOrr = ^Fq, (3.402) 

SO we get the following equation of motion: 

Fo ■ r = 0. (3.403) 

Now we know that t is never zero, so Fq must have at least one null eigenvector 
with nonzero time component. In particular, this must be true in a preferred 
frame, for which f3^ = 0. Thus the parallel two-flat must be the nullspace of Fq. 
So we demand that 

Xe = 0, (3.404) 

where we again emphasize that A^; is computed from Eqs. ()3.282|) . fj3.287p . 
fj3.288|) and ()3.292|] using Fq in place of F. This is a restriction on the allowed zero 
order fields. It is the relativistic analog of the usual nonrelativistic restriction 
that E\\ = to lowest order. Recall that we used this assumption in Section EH 
when we first discussed the basis tetrads. 
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Thus, when we order the four potential in e, we must keep in mind that the 
field derived from Aq should have no E\\. If we have a problem in which there is 
nonzero then it must be included in An where ti > 1. In particular, it could 
all be put into Ai. The only reason for keeping A^ where n > 2 in our theory 
is that sometimes a problem admits another expansion parameter in the field 
geometry (the stellarator expansion parameter and the long-thin parameter in 
mirrors are examples), and in some asymptotic theories that other expansion 
parameter may be taken to be equal to the guiding-center expansion parameter. 
In such cases, one might want to expand the field in a general power series in e, 
rather than just restrict oneself to the use of Aq and Ai. 

Thus, Eq. fl.H .40311 constitutes only two independent conditions on the four 
components of r. Dotting it with c and a and using Eq. ()3.35njl gives c-r = a-r = 
0, so r must lie in the parallel two-fiat; that is, the particle motion is constrained 
to lie along the field lines like that of a bead sliding along a wire. The rapid 
oscillatory motion is then considered to be a modification to this motion along 
the field lines, to be transformed away except for the residual perpendicular 
drifting motion. 

What is perhaps most disturbing about Eq. (|3.4U3|1 is that it gives only two 
dynamical equations of motion when there are really eight independent phase 
space coordinates. It gives us no description of the motion along the field lines, 
and no description of the rate of change of the velocity components. This is 
because the matrix of components of the zero order Lagrangian two-form is a 
eight by eight matrix whose rank is only two. This is thus an example of a 
problem in asymptotics with no well-defined limit problem; this phenomenon is 
by no means rare and has been discussed in a general context by Kruskal [321 • 

To get a better idea of what is going on here, we should consider the full 
particle equations of motion, retaining the lowest order nonzero contributions to 
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each component of u = d'-f, even if some are higher order than others. We find 

UJrr = ^Fo + C(l), (3.405) 

(^rk = —mt, (3.406) 

ujri3 = — m/cb, (3.407) 

ujrw = — mc, (3.408) 

Urd = +miya, (3.409) 

with all other components vanishing. Forming the equations of motion, uj ■ z = 
dH/dz, we find that 

r = /ct + wc, (3.410) 

so there is no longer any ambiguity in the parallel motion. Similarly we can now 
find the equations of motion for the velocity components. We get 

k = C(l), (3.411) 

P = 0(1), (3.412) 

w = 0{1), (3.413) 



and 



e = ^Ob + C(1). (3.414) 



This makes it clear that the dominant motion at lowest order is the gyration, 
in accordance with our intuition. Thus, as e — 0, we have the rate of change 
of 9 dominating that of all the other dynamical variables, including r. Hence, 
averages over the unperturbed motion will simply be averages over 9. 

Note that in order to get this zero order equation of motion, we needed 7^ 
only to order e~^, while all the other components of 7 were needed to order 
unity. This peculiar mixing of orders persists to higher order; so to obtain the 
n-th order guiding-center equations of motion, we will need 7^ only to order 
n — 1, while all the other components of 7 will be needed to order n. 
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3.8 The Preparatory Lie Transform 



All treatments of guiding-center motion share one feature in common: In the 
transformation from particle position, r, to guiding-center position, R, they all 
include the term, —wsl/Qb- This is the gyroradius vector, and it is the most 
intuitive term in the entire guiding-center transformation (indeed, one might 
argue that it is the only intuitive term in the entire guiding-center transforma- 
tion) . We shall make this transformation before we do anything else, as this was 
found to facilitate the remainder of the calculation in Littlejohn's nonrelativistic 
treatment [Zj- 

From Eqs. fl2.3()|l and f|2.215jl . we see that, to first order, the difference be- 
tween z and Z is simply given by the components of the generator vector, g. So 
since we want to have R = r — wsl/Qb, we see that we should choose 

w 

g; = - — a, (3.415) 

where the subscript "p" denotes "preparatory." 

Now g^ is clearly boostgauge and gyrogauge invariant, but from Eq. ()3.394p 
we see that a Lie transform generated by this vector alone would not preserve 
the gauge invariance of the action one-form. Consequently, we must append the 
following additional components to gpi 

Q, (3.416) 

7^. (3.417) 

First note that the Hamiltonian, Eq. ()3.369p , is unaffected by the preparatory 
Lie transform because it is independent of r, (3 and 9 (so CpH" = 0). Next, using 
Eqs. fj2.272|) through ()2.275p . we calculate the new action one- form resulting from 
the transformation generated by this vector. This transformation takes place at 



and 



w 



9p 



w 



n, 
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first order only, so we may set gi = Qp and g2 = gs = in those equations. Also, 
since we are interested in calculating the guiding-center equations of motion to 
third order (this turns out to be one order higher than the usual perpendicular 
drifts), we do not need T^^. 

At zero order, we have the obvious 

To = 70- (3.418) 
This has the single nonzero component. 

To, = -Ao. (3.419) 

c 

The corresponding Lagrangian two- form, wq, was given in Eq. (|3.4(J2p . 

Moving on to first order, it is readily found that ipUJo (where, in keeping with 
past convention, ip = ig^) has only one nonzero component, 

{ipUJo)r = mwc. (3.420) 

We take 5"! = 0, so Eq. fj2.273|) gives the following nonzero component for Fi: 

Ti^ = -Ai + mki. (3.421) 

c 

Note that the aforementioned rapidly oscillating term, mwc, has been removed 
from by the transformation. 

Before proceeding to second order, we need to calculate wi = d'ji and Oi = 
dVi. The first of these has the following nonzero components: 

uJi^r = -Fi + 'mk{\/i - tv) 



+mw{Vc- c\/), (3.422) 

wirfc = — mt, (3.423) 

uirp = -mkh, (3.424) 

ujirw = — mc, (3.425) 

uJirQ = +mwa. (3.426) 
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The second has the following nonzero components: 



^1 rr 
^Irk 



-Fi + mA;(Vt - tV), 

c 

— mt, 
—mkh. 



(3.427) 
(3.428) 
(3.429) 



Note that we have introduced the notation tV for the transpose of Vt. 

We are now ready to proceed to second order. First note that ^ipivi has the 
following nonzero components: 



1 w 
2fi 



-a 



B 



'-Fi + mk{Vt ■ P± - tV) 



+'mw{Vc ■ P\\ — cV) 



and 



mw 



20 



B 



(3.430) 
(3.431) 



Next note that |zpOi has the single nonzero component, 



1 



( 2 ^ 1 ) 



1 w 



-a ■ 



-Fi + m/c(Vt ■ P± - tV) 



(3.432) 



2 Qb Lc 

Now, using Eq. ()2.274|) and choosing 5*2 = 0, we can write down the nonzero 
components of 



-A2 + —a 
c Hb 



-Fi + mfc(Vt-P±-tV) 



+^(Vc-P|| -cV) 



and 



mw 



2B 



20 



B 



(3.433) 



(3.434) 



Note that has rapidly oscillating terms; these will be removed by subse- 
quent Lie transforms. Also note the appearance of the gyromomentum as the 9 
component of r2. 
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Moving on to third order, we recall that we do not need T^r- Referring to 
Eq. fl2.275|l . it is easily seen that 73 and ipUJ2 both have only an r-component, so 
we do not bother with these terms. Then ^ipdipUJi has a nonzero r-component 
which we shall not calculate, and it also has a nonzero 9 component given by 



1 



ipdipUJi)t 



mw 



a ■ S7^B 



w 

30|' 



-Fi + m/c( Vt - tV) 



■ a. 



(3.435) 



Similarly, ^ipdipQ,i has a nonzero r-component which we shall not calculate, 
and it also has a nonzero 9 component given by 



-ipdipQi)e 



w 



60| 



-Fi +mk{\/t - tV) 



LC 



a. 



(3.436) 



Taking ^3 = 0, we see that the nonzero components of r3 are F^r and 



36 



mw 
30| 



a ■ \/^B 



w 



-Fi + mk{Vt - tV) 



■ a. 



(3.437) 



Note that this has rapidly oscillating terms which will have to be removed by 
subsequent Lie transforms. This completes the preparatory transformation. 



3.9 The Averaging Lie Transforms 

We now perform the averaging Lie transformations that will take us to the 
guiding-center action one-form. These are somewhat more difficult than the 
preparatory transformation, since we do not know the generators in advance. 
For economy of notation, we reset our variables as follows: We shall henceforth 
refer to the Hamiltonian and action one-form that resulted from the preparatory 
transformation as H" and 7, respectively, and these new Lie transforms will take 
us to H'" and F. 

First consider the action one form. Once again, nothing changes at order 
zero, so 

Fo = 70, (3.438) 
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and the only nonzero component of this is 

To, = -Ao. (3.439) 

c 

The corresponding Lagrangian two- form, ujq, was given in Eq. (|3.4U2jl : its only 
nonzero component was Worr- 

At order one, we take gl = and 5'i = because we have already succeeded 
in averaging Fi^ by the preparatory transformation, and we don't want to ruin 
this. It follows that iiUjQ = 0, and so Fi =71. The only nonvanishing component 
of Fi is then 

Vi^ = -Ai + mki. (3.440) 

c 

Note that we have not yet had to specify Qi, Qi^ Qi^ or gf, since it is clear that 
these have no effect on Fi. These components of gi will be useful in the averaging 
of F2. Also note that Oi = Wi is given by Eqs. (I3.427j) through (I3.429j) . 

A word of caution is in order concerning the coordinate r. It is not altered 
in any way by the transformation. This means that after we complete the 
transformation to guiding-center coordinates, r will still be the single-particle 
proper time; it will nothe the guiding-center proper time. So g^j^dr^dr'^ = —dr"^, 
but g^^dR^dW^ 7^ —dr'^. Thus, throughout the remainder of this calculation, it 
is best to regard r as simply an orbit parameter, devoid of relevant physical 
significance. 

Now we proceed to second order. Note that Z2Wo has only an r-component, 

(^2(^0). = -92 ■ Fo- (3.441) 

c 

Next note that ^ii^i = ^iiuJi has the following nonzero component: 

{^iiui)r = ^{mkg^h + mg^i), (3.442) 

We then take S'2 = because we have already succeeded in averaging F26) by the 
preparatory transformation, and we don't want to ruin this. Equation ()2.274p 
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then gives the foUowing nonzero components for 



C ilB 



mw 



-Fi + m/c(Vt ■ P± - tV) 
c 

e 



and 



f ^(Vc ■ P\\ - cV) - -J2 ■ Fo - mkg^h - m^^^t, (3.443) 

(3.444) 



2^3 

We now proceed to third order, and once again we do not need the r- 
component of F^. Referring to Eq. fj2.275|l . it is easily seen that i^ujo has only 
an r-component, so we do not bother with this term. Then ^2^1 = «2'^i has a 
nonzero r-component which we shall not calculate; its other nonzero components 
are 

{i2UJi)k = -mgl-i, (3.445) 

and 

{i200i)p = -mkgl-h. (3.446) 

Next, iiUJ2 has a nonzero r-component which we shall not calculate; its other 
nonzero components are 

(ziu;2)^ = -TT^l (3-447) 



and 



7777/1 

(^l^2)e = +7S-^7r- (3.448) 



Next, ^iidii{u)i + ^fii) = ^iidiiuji has a nonzero r-component which we shall 
not calculate; it has no other nonzero components. From Eq. ()2.275jl we see that 
the nonzero components of are T^^ and the following: 

Tsfc = mgl-i + ^, (3.449) 
Ts^ = mkgl-h+^, (3.450) 
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mw a dSs 
91 + 



n 



and 



B 



w 



2fi| 



dw 



-Fi + mfc(Vt - tV) 

c 



(3.451) 



-a ■ Vfis - T^gi + 



3n% 



B 



de 



(3.452) 



Now we apply the Lie transform to the Hamiltonian. This is straightforward, 
and we get 

H'" = H'^' + eH^' + 0{e^), (3.453) 



where 



and 



H'^' = H" = mi-e + w^)/2, 



H'" 



mkgi — mwgi . 



(3.454) 



(3.455) 



Thus, the Hamiltonian, which emerged unscathed from the preparatory Lie 
transform, may indeed be modified by the averaging Lie transform. 

We must now choose the vector generator components, g^, g^, g^, g^, and f/^, 
and the scalar gauge transformation generator, 5*3, in order to average and maxi- 
mally simplify r2r, -^2"? ^Ski^spi ^Swi ^ind T^q. These are given by Eqs. fj3.443p . 
dSjEni), <^3M, fpr™ . fpOimi . and dnSni, respectively. We proceed by tak- 
ing the averaged parts of these equations. 



if- 

^3/3 



e , mw 

-Ai 

c 20b 



7?. - - ( a ■ Vc - c • Va 



'-g\ ■ Fq - mg'li - m/cfiff b. 



mkg\ — mwgi 



mg2 ■ t + 



mkg2 ■ b + 



dk[ 
dSs 



dp ' 



(3.456) 
(3.457) 
(3.458) 

(3.459) 
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^3w — 



mw ~Q dSs 
9f + 



n 



B 



dw ' 



40| \mc) ° 



(—] Fi + mk fvt - tV 

\mcJ V 



mw - dSs 
-9i + 



and the fluctuating parts, 

w . 



B 



d6 



= 



B 



^- + mk (yi ■ P± - tV 



— (a.Vc + c.Va).P|| 

fif^ ■ Fo - mg\l - mkg{h, 



e ~ 

„r 








mkgi — mwg"^, 



mg2 ■ t + 



= mkg2 ■ b + 
-- 



dk ' 

9^3 



mw ~a 
9i + 



d(3 







B 

mw^ 
"30| 



dw ' 

mw ~ dSs 
9i + 



n 



B 



86 



(3.460) 



(3.461) 



(3.462) 
(3.463) 
(3.464) 

(3.465) 
(3.466) 
(3.467) 



where we have demanded that the Hamiltonian and one-form components them- 
selves be purely averaged. In the above equations, an overbar denotes the aver- 
aged part of a quantity, while an overtilde denotes the fluctuating part. 

Solve Eq. (|3.467ll for 883/86 in terms of fiff . Then use Eq. (|3.463|1 to get g~'^ 
in terms of g'^. Then dot Eq. fj3.462|) with t in order to get g^. The result is 



dSs 
86 



mw^ „ ^„ k f w 
30|^-^"^"0^io" 



B 



-a ■ Fi ■ t — mkitta) 
c 



mw 



40 



B 



[{act) + (cat)]|. 



(3.468) 



where the abbreviation (act) is shorthand for a ■ Vc ■ t, etc. Now this equation 
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is easily integrated to give 



mw^ , -^^ wk 



Lc 



-c ■ Fi ■ t — ■mk{Uc) 



+ 



mw'^k 



80| 



[{ata) + (etc)] . 



(3.469) 



We can now back substitute to get the oscillatory parts of the vector generator 
components, 



9i 



f3 

9i 



9i 



9{ 



w ^ ^ „ kw , 
— t-Fi-a+— (tta) 
ab ^Ib 



w 



40 



B 



[{act) + (cat)] 



_a.F,.b--(«o) + — [(oc6) + („6)] 



— t-Fi-a+— (tta) 
Ab i'B 



kw 



40 



B 



[{act) + (cat)] 



(3.470) 
(3.471) 
(3.472) 



— 2-c ■ VOs — c ■ Fi ■ t 

O^ WA} 



\B 



k' 



B 



-{ttc) 



k 



40 



B 



[{ata) — {etc)] 



(3.473) 



and 



9l 



w 



Xb^b 



(P|| -P^) -Fi-c + ^fa-Va-c- Vc) -P, 



kw 
02" 



80| 



c ■ Vt ■ P± - P± ■ Vt ■ c 



- t- Vc-P|| -P|| ■ Vc-t 



(3.474) 

Next we consider the equations for the averaged parts of the generators, 
Eqs. ()3.456|1 through ()3.461|) . These constitute nine equations (Eq. fj3.456|l is 
really four equations) in seventeen unknowns (the nine components of F, and 
the eight components of g). Thus, we can choose eight unknowns at will. So we 
demand 

,2 



^2r 


= -A2- 




c 




= 0, 


^3f3 


= 0, 



mw 



20 



7^, 



B 



(3.475) 
(3.476) 
(3.477) 
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Ts^ = 0, (3.478) 

and 

Tse = 0. (3.479) 

Here we have retained the term involving TZ in in order to preserve boost- 
gauge and gyrogauge invariance, according to Eq. ()3.389p . Taking S3 = 0, we 
can now solve for g. We get 

9^2 = 0, (3.480) 



2 

w 



9^ = [{cat) - (act)] , (3.481) 



2 



9? = -4i^[(c«^)-(«c^)]' (3-482) 

gf = 0. (3.484) 
We can now solve for H2' using Eq. fj3.457|l to get 



mw 



(—) 
4ri| Kmc J 



(-) 

\mc/ 



^1 + Vt - tV 



(3.485) 



This completes the averaging transformation. 

Henceforth, we shall write transformed quantities as functions of the guiding- 
center variables (i?, K, B, W , 9) instead of their lower-case counterparts. Note 
that this has no mathematical significance, and is done only to emphasize the 
physical interpretation of the various quantities that emerge from the theory. We 
regard functions in the mathematicians' sense of the word: functional arguments 
are nothing more than dummy placeholders. 

We may now write out the full guiding-center Hamiltonian and action one 
form to the above-described order. We have 
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and 



+ y ( Vt - tV 



(3.486) 



- {Ao + eAi + e'A^) + mKt - ^^7^ + 0{e^] 



■dR 



(3.487) 



Note that 9 is an ignorable coordinate, so that its canonicaUy conjugate mo- 
mentum, fi = mW'^/2QB, is conserved. This can now be identified as the gy- 
romomentum, and it is useful to ehminate the coordinate W in favor of fi. The 
results will be denoted 



and 



2A 



B 



(— )Fi + ^(Vt-tV 

\mc/ 2 



—A + mKt-eiin + 0{e^) -dR + eiidQ. 
ec 



+ 0{e^) (3.488) 



(3.489) 



This is the form of the guiding-center Hamiltonian and action one form that will 
be used in subsequent sections. Note that the order e term in the Hamiltonian 
may be neglected if only the classical drifts (usual gradient, polarization and 
curvature drifts) are desired. 



3.10 The Guiding-Center Poisson Brackets 

As a first step towards writing down the guiding-center equations of motion, we 
form the guiding-center Lagrangian two-form. The nonzero components are 

^RR = -(Fo + eF')+mK(Vt-tV), (3.490) 

ec 

= -mi, (3.491) 

= -mKh, (3.492) 
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, (classical order) 
= { (3.493) 
eTZ (higher order) , 

fi^e = e, (3.494) 



where 




(classical order) 
Fi + eF2 — —fxM (higher order), 



(3.495) 



and 



Af = dn. (3.496) 



Here we have drawn a distinction between two cases, just as we did with the 
Hamiltonian. Terms of classical order are all that are necessary to retain if only 
the usual gradient, curvature and polarization drifts are desired. If one would 
like the equations of motion to one order higher than that, one must also retain 
the terms labelled higher order. This makes a difference only in and in the 
definition of F' . 

Now we can get the Poisson brackets using Eq. fj2.168|l . We do this by 
inverting the eight by eight matrix consisting of the components of O. This is a 
tedious but straightforward exercise, and the nonvanishing results are presented 
below. We have performed this matrix inversion for both the classical-order and 
the higher-order cases separately. 

{R,R} = (3.497) 

{R,K} = --^-S, (3.498) 

m 

{R,B} = (3.499) 
mK 

{0 (classical order) 

(3.500) 
e{R, R} ■ TZ (higher order) , 
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. (classical order) 

{K,e} = { (3.502) 

e{K , R} ■ TZ (higher order) , 

, (classical order) 

{B,Q} = { (3.503) 

e{i3, R} ■ TZ (higher order) , 



and 



{e,//} = e-\ (3.504) 
where we have defined the scalar 



T = 1 + (3.505) 



and the tensors 



eF" ■ Fo 

and 



S = 1 + ° (3.506) 



F" = F'+ ( Vt - t V) , (3.507) 

and where F' is given by Eq. fj3.495p . Note carefully that the bracket of R with 
R is nonzero because R is really four coordinates; thus {R, R} is a four by four 
antisymmetric matrix and, consequently, its diagonal elements vanish but the 
rest of it may be nonzero. 

Note that O and n are decoupled from the other dynamical variables at the 
classical order, but that O is not decoupled at higher order. The reason for 
this will be clarified shortly, but for now we note that this coupling is not at 
all problematic. The important point is that the set of functions of R, K and 
B form a subset of the set of all phase functions that is a closed Lie subalgebra 
under the operation of these Poisson brackets. Then, since our Hamiltonian is 
independent of 0, we can eliminate that degree of freedom and still have a valid 
Hamiltonian system for guiding centers. This is an example of the reduction of 
a Hamiltonian system, discussed in Chapter |21 
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Next note that we could have expanded aU of the above expressions in pure 
power series in e. For example, T appears in the denominators of several brackets, 
and consists of an order one term and an order e term. One might argue that, 
since our expressions are valid only to a certain power of e anyway, we ought to 
expand this in powers of e. There is, however, a compelling reason not to do this: 
The above brackets are guaranteed to obey the Jacobi identity exactly because 
they are elements of the inverse matrix of the matrix of components of the 
Lagrange tensor which obeys dfl = ddV = 0. If we were to expand the brackets in 
e, and retain e only to a certain power, then the Jacobi identity would be satisfied 
only to that power of e. Now one might counter that in an asymptotic theory of 
this nature, that is all we have a right to demand. In practice, however, guiding- 
center equations of motion are often integrated numerically, and violations of 
the Jacobi identity invalidate Liouville's theorem which guarantees phase space 
area preservation. This, in turn, can lead to an observed "fuzziness" of KAM 
tori which might cause one to draw erroneous conclusions about the presence of 
stochasticity. 

To elaborate on this last point, in studies of mirror-confined plasmas, for 
example, one might integrate the guiding-center equations numerically and pro- 
duce a "puncture plot" of the places where the trajectory of the guiding center 
intersects the midplane of the device. If such a plot exhibits stochasticity, one 
might well expect the radial transport of the plasma to be enhanced significantly 
as compared to a case for which the plot is a smooth KAM surface. Thus, in a 
study of mirror plasma radial transport, one might vary some parameter to see 
for what value this transition from regular to stochastic motion takes place. The 
decision might be made by comparing the numerically-generated puncture-plots 
for several different parameter values in some range. Yet if one uses guiding- 
center equations of motion that do not satisfy Liouville's theorem exactly, one 
runs the risk of misinterpreting "fuzziness" in plots that is due only to violations 
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of Liouville's theorem (which is, after aU, the only reason that KAM tori exist 
in the first place) as the presence of true stochasticity. 

This is why we inverted the Lagrange tensor for the classical and the higher- 
order cases separately, rather than do a single inversion for the higher-order case 
and truncate to get the classical case. As things stand, the brackets for both 
cases presented above are guaranteed to satisfy the Jacobi identity exactly. 

3.11 Guiding-Center Equations of Motion 

These brackets together with the Hamiltonian, Eq. ()3.488p . give the guiding- 
center equations of motion according to Eq. (|2.1()9|1 . First consider the equation 
for R. To the classical order, this may be written 



The first term contains the usual parallel motion and the E x B drift. The order 
e contribution consists of two parts: The first contains the relativistic analog 
of the curvature and polarization drifts (they are in F"), and the second is the 
relativistic analog of the grad-S drift; these statements will be clarified when 
we cast these results in "1 -|- 3" notation. Of course, the above apparatus is 
sufficient to get R to one order higher than this, but the expression itself is 
rather unenlightening to look at, so we shall not bother to write it down. 
The equations for K and B are then 



R = {R,R} ■ iJ.'V^B - {R,K}mK 




(3.508) 



K 



{K,R}-iiVVtB 



(3.509) 



m 



and 



B = {B,R} 



IjV^b - {B,K}mK 
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= ■ S ■ - —i-E- F" -h. (3.510) 

rriK mc 

The terms containing V^^ b contain the mirroring force, and the contribution of 
Fi contains the force due to the paraUel electric field; once again, these state- 
ments will be clarified when we cast these results in "1 + 3" notation. 

Next note that fi is exactly zero, even at the higher order; this, of course, 
was our aim all along. The higher order equation of motion for O is 

e = if]^ + e7^ ■ i? + -^Fo : 
e zAb 

The first term is the lowest-order gyromotion. The second term arises from 
the bracket structure, and corrects for the possibility that as the guiding-center 
moves in R, the perpendicular unit vectors upon which the definition of is 
based may rotate within the perpendicular two-fiat. This term arose from our 
demand of boostgauge and gyrogauge invariance, and it is the reason that the 
Poisson bracket of G with R, K, and B cannot vanish at higher order. The neces- 
sity of this has been discussed by Littlejohn and by Hagan and Frieman [32] • 
The third term on the right side of Eq. ()3. 51111 arises from the first-order piece 
of the Hamiltonian and consists of two subterms in the square brackets. The 
first of these subterms is the correction to the gyrofrequency due to Fi. To see 
this, define the total gyrofrequency due to both Fq and Fi hj Qbt = cAbt/^^c, 
where Xbt is given by Eq. (|3.293|1 . We quickly find 

^BT = —Jl{Fo + eFi):{Fo + eF,)+0{e^) 
mc V 2 

- —Jxl + eFo-.Fi + Oie^) 
mc V " 

= nB + -^(^)Fo:Fi + 0{e^). (3.512) 
2Xb \mc/ 

The second subterm of the third term on the right of Eq. ()3.511|) is the gy- 
rofrequency shift due to gradients of the perpendicular electric field. This is not 



mc/ 2 



vt - tv 



(3.51i; 
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expected to be obvious, and will be discussed further in Section 13 .13^ when we 
cast our results in "1 + 3" notation. 

The geometrical significance of the second term in Eq. ()3.511|1 is illustrated 
in Fig. 13.21 (here we temporarily revert to using lower-case r and 9) . In order 
to compare the unit tetrad at one point in spacetime, r, with that at another 
point, r + 6r, (to see how much it rotated) we need some way of transporting 
the unit vectors from one point to another. The correct way of doing this was 
elucidated by Littlejohn jUj. Since we have assumed fiat spacetime throughout 
this calculation, we can simply translate the unit vector 62 from r to r + 5r in 
the usual manner of Euclidean geometry. Of course, when we arrive at r + 6r, 
the translated unit vector, called 63, will not be the same as the unit vector 62. 
Furthermore, it need not even lie in the perpendicular two-fiat. To remedy this, 
we project it onto the perpendicular two-fiat and normalize the result to get a 
new unit vector, called 63*. The angle between 62 and 63* at the point r + 6r is 
defined to be 69. The calculation goes as follows: 



Henceforth, all quantities are evaluated at the point r + 6r so this will not be 
noted explicitly. Continuing, 



e2(r + dr) 



e2(r) 



62 (r + Sr — 6r) 

e2(r -|- 6r) — 6r ■ V^2{r + Sr) + 
-6rdr : VVe2(r + dr) + ■ ■ ■ . 



(3.513) 




6r ■ Ve2 ■ 6363 + -SrSr : ■ 6363 




(3.514) 
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Thus 



cos 56' = 1 — + 

■~- ^ + 



1 / ^ \2 

I - -{^5r- Ve2 ■ 63 j + 



l-l{n-drf + ■■■, (3.515) 



so we identify 



69 = n- Sr. (3.516) 



This is the change in 9 due to the rotation of the unit vectors alone, and it 
explains the second term on the right of Eq. fj3.511|l . A similar term, Q ■ R, 
would appear in the equation of motion of B if we went to higher order. 

It was noted by Littlejohn j^T] that the one-form, TZ is the potential for the 
gauge field Af = dTZ which obeys the field equation dJV = ddlZ = 0. In the 
relativistic problem, we also have the gauge field A4 = dQ, and this also obeys 
dAi = ddQ = 0. These are the gauge fields corresponding to the boostgauge 
and gyrogauge gauge groups. Note that AA and Af are gauge invariant even 
though Q and TZ are not. Thus, they can be expressed in terms of the field 
tensor directly; in index notation 

M^. = -Ljr^^p^P^^p^^^^^^^ (3.517) 

and 

A/;. = -^F-^Pf^,P±io.,.. (3.518) 

The R term of the guiding-center Lagrangian, Eq. ()3.489|1 . thus couples the two 
gauge potentials, A and 71, and the coupling constant is the gyromomentum. 
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Figure 3.2: Change in Gyroangle due to Rotation of Basis Tetrad as Guiding 
Center Moves in Spacetime 
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3.12 Summary of Guiding-Center Transforma- 
tion 

The entire transformation that we have made from the particle coordinates may 
be written in the form of Eq. fj2.214|) as follows: 



Z = exp(e£gp) exp{eCg^) ex.p{e^ Cg^) exp{e^Cg^) ■ ■ ■ z. (3.519) 



It is possible to expand these equations in e, and plug in our expressions for 
the generators to get the coordinate transformation equations. For reference, we 
present these here: 



and 



R 



K 



B 



W 



e 



= r 



ewa. 9 ( w 



a- V 



+ + 



a 



2fi| 



Xb^b 

2 



(P|| -P^)■F^■c 



2kw , , - r/ N / M - 

■(ttc)t + {{ata) - (etc)] t 



kw 
kw 



\{Uc) + {thc)\ b 



w 

8^1" 



\{aha) - {chc)\ b 



[(cto) - (ate)] a| + O(e^), 



+ e 



kw 



B 
W 



(tta) 



w 



20 



B 



{act) 



+ t ■ Fi ■ a 



/3 + e 



w w 



w 



w ^ e 
+ 



kXs 

Ob 
A; 



b ■ Fi ■ a 



{tta) 



kw 



40 



B 



[3 (act) - (cat)] 



Ab 2Ab 



a ■ Fi ■ c 



+ 0(e2), 



wOb 40b Ob 



(3.520) 



(3.52i; 



(3.522) 



(3.523) 
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+ 



w 



c ■ \/VLb + 



-t ■ Fi ■ c 



B 



(3.524) 



In the above equations, the capitahzed variables are the guiding-center coordi- 
nates and the lower case variables are the particle coordinates; it is emphasized 
that all quantities on the right hand sides of these equations (e.g. unit vec- 
tors, field tensor, etc.) are evaluated at the particle coordinates. The inverse 
transformation is given by 



w 



and 



R + 



eWa 



B 



2Qb \^B ) ^B^B 



w 



P±) ■F^■c 



+t 
+h 
+c 



2KW 
L q2 



{ttc) 



— Yliata) — {etc)] 



80 



B 

-^[{aha) - (c6c)] 



80 



B 



KW 

-^[{htc) + {the) 



03 



c ■ VOs + 



K 



Xb^b 



B 

t ■ Fi ■ c 



02 



{ttc) 



KW W'^ 
+ - (etc)] + -^{aca) 



+a 



W 



2Xb^b 
K"^ , . KW 



a ■ Fi • c 



K 



Xb^b 



-t ■ Fi ■ a 



K-e 
+ 

B- 



02 

KW 



{tta) + ^^[3(cat) - (act)] J | + 0{t 



- Ob 
W 

X^ 



{tta) - 



"B 

W^ 



20 



B 



{act) 



t ■ Fi ■ a 



{acb) 



+ 0{e' 
W 



V2KVLb 

b ■ Fi ■ a 



B 



= W 



KX 

K^ 

— (tta) 



Ob 

+ 0{e' 
KW 



[{bta) - {atb)] 



4^^-[3(act)-(cat)] 

K ^ . W ^ ^ ^ 
+— t-Fi-a+-— a-Fi-c 

Ab ^Ab 



+ 0{e'), 



(3.525) 



(3.526) 



(3.527) 



(3.528) 
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e 



e-e 



r i^^ 



K 



[{ata) — {etc)] 



W 



(aca) 



lwn_ 
w ^ 



C ■ \/^B + 



40b 
K 



t-Fi-c +0(e2). 



(3.529) 



WXb 



In the above equations, everything on the right is evaluated at the guiding-center 
position. 

RecaU that the gyromomentum in guiding-center coordinates is given by 
mW^/2flB- In particle coordinates, this may be written 



This expression is useful because it gives the conserved quantity in terms of 
particle coordinates. 

3.13 Comparison with Three- Vector Formula- 
tions 

In order to compare our results with the three-vector formulation given by 
Northrop jlH]? we must be able to cast our results into "1 + 3" notation. 
We learned how to do this for the particle coordinates back at the end of 
Section 13.41 where we gave the explicit transformation equations, Eqs. ()3.357j1 
through fj3.363|l . These are scalar equations in phase space, and so they will 
retain their form under the guiding-center Lie transform. We need only to re- 
place {k,(3,w,9) by (K, W,0), and to reinterpret /3„ as the guiding- center 
three- velocity (divided by c). Then we can write down the equations of motion 
for by differentiating Eqs. fj3.357p through ()3.359|) with respect to proper 
time, using the known equations of motion for the guiding-center coordinates, 
and expressing the results back in terms of f3^ by using Eqs. ()3.36U|1 through 



A* part 




dssnsD- 
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The above-described program seems rather tedious. Fortunately, there are 
two things that we can do to simphfy the task. First, we need only check our 
results to the order of the classical drifts. This is the order given in the text by 
Northrop jl3] . Second, we can check our results in one of the "preferred" frames 
of reference, as were described back in Section 13.41 If they hold there, they have 
to hold in all other frames as well because our results are in manifestly covariant 
format. These two simplifications make the problem straightforward. 

First note that in a preferred frame (3^ — 0, so Eqs. fj3.357|l through ()3.363p 
become 

(3yi = tanhi? (3.531) 
VTsine , , 

p^, = (3.B33) 



K cosh B 



and 

K = c7„y/l-/32^ (3.534) 

B = tanh"^/3„i (3.535) 

W = nv^P'.2 + P'v3 (3-536) 

e = arg(-/3,3-^/5.2), (3.537) 

where, as noted in the last paragraph, all variables are now guiding- center vari- 
ables. In particular, the equations 

K cosh B = c-f^ (3.538) 

and 

KsinhB = c7„/3„i = 7„t'||, (3.539) 

where f|| = c/3„i, will turn out to be particularly useful. The quantity B is 
sometimes called the rapidity. 
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Next note that, in a preferred frame, the unit vectors that we constructed in 
Eqs. fl3.33()jl and (13.331;) can be inserted into Eqs. (13.343;) and (j3.345jl to yield 

(coshi3 \ - / sinhi3 \ 
, b = . (3.540) 

b sinh B I \ b cosh B I 

These wiU also be useful in what follows. 

Now examine Eq. fj3.5()8j) . We can consider the terms individually. First 

Ki =1 ^""^ I (3.541) 



follows immediately. Next 



- ^ ^ mcK ^ 

t- F" = t ■ Fi H 1 ■ Vt 

e 



^ mcK ( ,^19 , ^, „\ - 

t ■ Fi H coshi3-— +sinhi3b ■ V t 

e \ cat ) 

mcv\\ o / 

Vt+^l|b-Vb 



"^'lU . " I, (3.542) 



where 



E X B , , 

= c , (3.543) 



also follows after a short computation. Note that u^; vanishes in a preferred 
frame, but its derivatives may not; thus we had to apply the derivative to t 
before specializing to a preferred frame. 

Next we write the components of Fi as follows 

f El \ 
Fi = . (3.544) 

\^ El 1 X Bi y 

Recall that Ei must contain all of the parallel electric field. 
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It now follows from Eq. fl3.5()8|l that 



ci = C7„ + 0(e) 



(3.545) 



and 



R 



mc 



~dt 



+ V\\hT ■ Vby 



V dt 



m 



where 



B + eBi 



(3.546) 



(3.547) 



|B + eBi| 

Now take the perpendicular part of R by dotting it with 1 — bj-bT, then divide 
by i to get 



dR±_ 

dt 



B 



-br X |7„ 
fduE 

\ dt 



dhn 



dt 

+ V\\hT • VU£; 



+ V\\hT ■ Vbx 



+ 



m7„ 



VOs ^ + C(e2).(3.548) 



This is identical to Eq. (1.76) in the text by Northrop jlH] in a preferred frame. 
Recall that = S in a preferred frame, so that Qb in the above equation 
is simply eB/mc. The classical curvature, gradient and polarization drifts are 
readily visible in the above equation. If we had instead done the calculation for 
a general frame of reference, the E x B drift would appear as well. The reader 
is referred to Northrop |331 for a good discussion of these results. 
Next differentiate ■jvV\\ = K sinhB to get 



^(-f^vn) = —(k sinhB + KB coshB). 

dt 7^ 



(3.549) 



Insert Eqs. fj3.5U9|l and fj3.51U|l for K and B, respectively, and after a little 
algebra we find 



d 
di 



ilvV\\ 



1 



-^br • VOs - -Ivi ■ Fi ■ b ) + 0{e). 
m m 



(3.550) 
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Now it follows from Eq. fl3 .54411 that 

t . Fi ■ b = -b ■ El = , (3.551) 

So we finally have 

^AlvV\\) = ^hr ■ VOb + + 0{e). (3.552) 

at " rw-fy m " 

This is identical to Eq. (1.77) in the text by Northrop in a preferred frame. 
The terms on the right are the mirroring force and the force due to the parallel 
electric field, respectively. 

Northrop's Eq. (1.78) is immediately seen to be equivalent to the fact that our 
gyromomentum fi is a constant of the motion. Note that Northrop's magnetic 
moment is related to our as follows: Mj. = en/mc. 

Next, we know from Eq. fj3.538p that C7„ = K coshB, so 

(i 771 C 

— {mc^-fy) = — (i^coshi3 + i^sinhi3i3). (3.553) 
at 7„ 

Now use Eqs. ^TEm . ipmil and fpTHMll to get 

|(mc%.) = ^^^+ ev\\E\\ + (9(e) (3.554) 

after a short calculation. This is identical to Eq. (1.79) in the text by 
Northrop |13j in a preferred frame. 

Finally, as promised, we discuss the nonrelativistic limit of the second sub- 
term of the third term on the right side of Eq. fj3.511|l . This term is given by 
{K/4Xb)Fo : (Vt — tv)- To simplify the evaluation of this term, we special- 
ize to a preferred frame where the perpendicular electric field vanishes (though 
we shall be careful to retain its gradient). We also specialize to the case of 
time-independent fields, spatially uniform magnetic field, and zero parallel ve- 
locity. These assumptions are not at all necessary; they serve only to simplify 
an otherwise tedious calculation, to aid the reader in seeing an effect that would 
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otherwise be masked by lots of other less interesting terms, and to facilitate 
comparison with Appendix O Under these circumstances, we find that 




VKt = C7„ I _ I , (3.555) 

and 



^0 




1 X b 

It then follows after a short calculation that 



(3.556) 



^ -Fo : (Vt - tV) = ■ (V X /3e) = -77^7^ ■ E^. (3.557) 



4Ab ^ ' 2Xb ^ 2milB 

Except for the factor 7„, which is clearly a relativistic effect, this is identical 
to the gyrofrequency shift due to perpendicular electric fields that is derived 
in Appendix O This shift was discovered by Kaufman in I960, who also 
showed that it gives rise to the phenomenon of gyroviscosity. 

The reader is urged to consult the text by Northrop as well as a paper 
by Vandervoort a further discussion and alternative presentation of the 

above results. 



3.14 Manifestly Boost gauge and Gyrogauge In- 
variant Format 

The guiding-center equations of motion presented above contain expressions, 
such as Vt, that are not boostgauge or gyrogauge invariant. Of course, the 
equations as a whole are guaranteed to be gauge invariant by our method of 
derivation; but they are not manifestly so. This is due to the fact that our 
chosen coordinates, namely (i2, K, B, /U, G), are themselves not gauge invariant, 
thanks to the inclusion of B and G. This observation suggests that if we were 
to transform to a new set of gauge invariant coordinates, we could write our 
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results in manifestly gauge invariant format; that is, without any mention of the 
unit vectors, Gq,. In this section, we shall derive two new versions of the Poisson 
brackets: The first will be manifestly boostgauge invariant, but it will not be 
manifestly gyrogauge invariant. The second will be both manifestly boostgauge 
invariant and manifestly gyrogauge invariant. 

3.14.1 Manifest Boostgauge Invariance 

To get manifestly boostgauge invariant results, we would like to replace K and 
B by the new boostgauge invariant coordinate 

U = Ki. (3.558) 

The inverse transformation would then be 



K = (3.559) 

and 

i3 = tanh-if-^^44SV (3.560) 

Alas, there is a problem with this approach. Since the new coordinate U is 
a four vector, it contains four degrees of freedom, whereas K and B represent 
only two degrees of freedom. This discrepency stems from the fact that U is not 
an arbitrary four vector because it is constrained to lie in the parallel two flat; 
that is, it obeys the constraint equation 

P^{R)-U = Q. (3.561) 

This constraint restricts U to two degrees of freedom, but it also means that 
the coordinates R and U are no longer independent variables. The coordinate 
transformation is not a diffeomorphism (it is injective rather than bijective) and 
so we cannot proceed in the usual manner. 
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We can remedy this difficulty by temporarily relaxing the constraint in 
Eq. ()3.56H1 . We make the following coordinate transformation (where, for clar- 
ity, we use primes to distinguish the new coordinates): 

R' = R 

u' = i^t(i?,s) + -%-c(i?,e) + Ci,a(i?,e) 

e' = e. (3.562) 

The reason for including in the second term on the right hand side of the 
equation for U' will become clear in the next subsection. The inverse transfor- 
mation is then 

R = R' 



K = ^-U'-P\\{R')-U' 

B ^ tanh-^ (U'-^^jR') 
V U'-eo{R') 



e = e' 

Cia = \b{R')U' ■ c{R' 

Cib = U' -aiR^e'). (3.563) 

Here, U' is no longer constrained to lie in the parallel two- flat, and its perpendic- 
ular components are called Cia/Xs and Cu- In order to have the same number 
of variables before and after the transformation, we have appended Cia and Cib 
to our usual set of variables before making the transformation. 

We now have a diffeomorphism, but we still have to decide how to deal with 
these two new variables in the unprimed system. Our strategy will be to demand 
that they are Casimir functions. That way, the dynamics is constrained to lie 
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on hypersurfaces for which they both are constant. If we start the phase space 
trajectory on the hypersurface for which they are both zero, it wiU remain on 
that hypersurface. Of course, the equations of motion that we end up with will 
also be capable of describing dynamics on other hypersurfaces for which they 
are nonzero, but we ignore these other orbits as physically irrelevent. 

So our phase space coordinates before this transformation are now taken to 
be (-R, -ftT, i5, /U, O, Cia, Cib). The bracket relations among these coordinates are 
given by Eqs. ()3.497|1 through fj3.5n4|l for the brackets not involving Cia and 
Cib. Then, following the strategy discussed in the last paragraph, we simply say 
that the bracket of Cia or Cib with any of the other coordinates is zero. We now 
have dynamics in a ten dimensional phase space, but we are interested in what 
is going on only in the eight dimensional subspace defined by Cia = Cib = 0. 
We have simply imbedded the guiding-center dynamics in a higher dimensional 
phase space. It is clear that the Poisson bracket still obeys antisymmetry and 
the Jacobi identity. 

It is now straightforward to write the Poisson bracket relations among the 
new set of coordinates, (i?', C/', /u', O'). Once we are finished doing this, it will 
be alright to set Cia and Cu equal to zero, but not until we have taken every 
derivative that needs to be taken in the process; derivatives get messed up by 
coordinate transformations that are not diffeomorphisms. 

We illustrate this calculation for the {R', U'} bracket as follows: 

{R\U'} = {R,Ki} + {R,Ciac/XBiR)} + {R,CibSi} 
= {R,Ki} + {R,Cia}c/XB{R) + {R,Cib}si 

c/XB{R)}Cia + {RA}Clb 

= {R,K}i + K{R,R} -^1 + K{R,B}h. (3.564) 

Note that all quantities on the right hand side in the above equation are ex- 
pressed in the old coordinate system. Note also that all terms involving Cia 
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or Cib have vanished, either because they are bracketed with something (recaU 
that they are Casimir functions), or because they appear in a term outside of 
aU derivatives and so we have set them to zero. 

Eqs. (|3.497jl through (|3.499jl can now be substituted into the right hand side 
of Eq. ()3.564jl . The result will still contain objects such as Vt and Vb. Eliminate 
these by means of the easily verified relations 

Vt = (VP||) -t- Qb (3.565) 

Vb = (VP||)-b-Qt. (3.566) 

Because our results are guaranteed to be boostgauge invariant, all terms involv- 
ing Q will cancel, leaving a manifestly boostgauge invariant result. This being 
the case, the result can be expressed in terms of the new coordinates. 

Before presenting these results, a word of warning is in order. When the 
term i^(V-P||) ■ t is expressed in the new coordinates, the result is easily found 
to be 

K(VP||) - t = (v'P\\{R')^ ■P\\{R')-U'. (3.567) 

Upon applying the constraint, P\\{R') ■ U' can be replaced by simply U' . One 
might thus be tempted to pull the following dubious maneuver: 

{v'p\\{R')^ ■ P\\{R') ■ U' = {v'P\\{R')^ ■U' = V {P\\{R') ■ U') = V U' = 0. 

(3.568) 

This is incorrect because after the constraint is applied^ R' and U' are no longer 

_>/ 

independent variables. We thus had no right to pull U' inside the V operator, 

_>/ 

nor did we have a right to say that S/ U' = 0. This is subtle but important, as 
the brackets below are full of things that look like P\\{P')^ ' U', and they 
are definitely not zero. 

We now present the full set of brackets in the new coordinate system (omit- 
ting the primes since ambiguity should no longer result from doing so). We 
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find 



{R.R} 
{R, U} 

{i?,e} 



mX s f2s T 

{t/,e} 

and 
{6,^} 

where we have defined 



m " mA^T 



2 x/^o 



F-.p,|-!^(VP||-t/) 



(classical order) 

e{R, R} ■ TZ (higher order) , 
Ob 



2mX% 



1 

m 



(classical order) 

e{U, R} ■ IZ (higher order), 



^F'" ■ Fq 



mc f 



F'" = F' + — ( {VP\\ ■ U) - (vPy ■ Uy 



(3.575) 

(3.576) 
(3.577) 



where F' was defined in Eq. (|3.495jl . and where the superscripted T means 
"transpose." Note that T', S' and F'" are the boostgauge invariant portions of 
T, S and F"; that is, they are related by 



T = r. 



(3.578) 



(3.579) 
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and 



F" = F"'+'^{hQ-Qh). 



(3.580) 



These new brackets may be compared to those for the old coordinates, given in 

Eqs. (nrwji to (USD. 

This Poisson structure has the Casimir function, P± ■ U, so the constraint 
Eq. fj3.561|l is guaranteed to hold for all times if it holds initially. The physical 
motion takes place on the hypersurface for which this Casimir function has the 
value zero. 

The guiding-center Hamiltonian, Eq. ()3.488|] . can now be expressed in the 
new boostgauge invariant coordinates: 



(-^) Fo-.Fi + P^: ((VP± ■U)-Fo) (3.581) 



Note that this Hamiltonian is also gyrogauge invariant, since it does not involve 

e. 

There is another way to derive the above manifestly boostgauge invariant 
Poisson brackets. We can write the phase space Lagrangian corresponding to 
Eq. ()3.489p in manifestly boostgauge invariant form as follows: 



Lge(i?, u, fi, e, R, e) 



—A + mU - eun + 0(e 
ec 



■ R + e^Q 

-XiaU-c{R,e)-x,,u-k{R,e) 

-Hg,{R,U,ij). (3.582) 

The action associated with this Lagrangian may be varied to yield the same 
equations of motion given by the manifestly boostgauge invariant brackets and 
Hamiltonian, but the variation of the action must be performed subject to the 
constraint, Eq. fj3.561|l . Hence we have introduced the Lagrange multipliers, Aia 
and Xib- Note that varying an action subject to a constraint causes the constraint 
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to appear as a Casimir of the resulting Poisson structure; recall the example of 
this phenomenon given in Subsection 12.3. 6L 

The equations of motion in this coordinate system are then easily found 
either by using the Poisson brackets given in Eqs. fj3.569p through fj3.574|l with 
the Hamiltonian given in Eq. (|3.581jl , or by finding the Euler-Lagrange equations 
from the phase space Lagrangian given in Eq. fj3.582|) . The results are 

^ _ jj _ eidFp ■ VOb 

+ ^^0 ■ {f'" . P|| - ^(VP|| ■U))-U (3.583) 



U 



^P||-VOb + 



n 



B 



2A| 



-^0 : (F 



^B 
III '-■iT-' 



Fo ■ V^B 



P|l • E' ■ (VP|| ■ U) - (VP|| • U) 



T '^iT 



Xb^bT' 



(VPi, ■ Uf ■ Fo ■ (VPi, ■ U) 



Pii 



u 



u 



e 



- + en-R+ ^ 



2A 



B 



mc 



Fo■.F^ + P^: ({VP±-U)-Fo 



(3.584) 

(3.585) 
(3.586) 

These equations of motion may be compared term for term with Eqs. fj3.5U8|l 
through fj3.51Hl . In the equation for R, note that the parallel motion is given 
simply by U. The second term contains the grad-S drift, and the third term 
contains the curvature and polarization drifts. The first term of U contains the 
mirroring force, and the force due to the parallel electric field arises from the 
terms that contain Fi (via their dependence on F'"). Of course, fi still vanishes, 
and the equation for O compares term for term with Eq. ()3.511|1 in an obvious 
way. 
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3.14.2 Manifest Boostgauge and Gyrogauge Invariance 

Now we can use the same techniques to make our results gyrogauge invariant as 
well. To do this, we would like to replace the coordinate O by the new coordinate 

a = a(i?,e). (3.587) 

The inverse transformation would then be 



e = arctan ( -^^4^ ) • (3-588) 



a ■ e2(-R), 

Note that a, like O, has only one degree of freedom, even though it is a four 
vector. This is because it is subject to the constraints 

P||(i?)-d = 0, (3.589) 

and 

a-a = l. (3.590) 

In order to deal with this in a proper fashion, we have to use the same 
techniques that we used above to get boostgauge invariant brackets. Write the 
coordinate transformation 

R' ^ R 

U' = Ki{R,B) + ^(-^c{R,e) + CuSi{R,e)] 

V<^3 \^B{R) ) 

a' = ^/C~sHR.Q) + ^(^-C2akR.}3) + ^^HR,B)^ (3.591) 
The inverse transformation is then 

R = R' 



K = ^-U' ■ P\\{R')-U' 
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// = ^' 

e = arctan (-?75{§j) 

Cia = U'-Fo{R')-a' 

Cu = U'-P±{R')-a' 

C2a = U' ■ P\\{R') ■ a' 

C2b = U' ■ T{R') ■ a' 

C3 = a-P±{R')-a' (3.592) 

We demand that Cia, Cu, C2a, C2b and C3 are Casimir functions, and that the 
physical motion takes place on the submanifold defined hj Cia = Cib = C2a = 
C2b = and C3 = 1. 

We can now write the Poisson bracket relations among the new coordinates. 
We use the easily verified relations 

Vt = (VP||) ■t + Qb (3.593) 

Vb = (Vi'll) -b- Qt (3.594) 

Vc= (VP±) ■c + 7^a (3.595) 

Va - {VP±) -a- Tie. (3.596) 

Note that, because our results are guaranteed to be both boostgauge and gyro- 
gauge invariant, all terms involving Q and TZ will cancel, leaving a manifestly 
boostgauge and gyrogauge invariant result. Also note that the Hamiltonian 
Hgc{R,U, y), given by Eq. fl3.581|l . is already manifestly gyrogauge invariant 
(this is because it is 0-independent) . The new manifestly boostgauge and gyro- 
gauge invariant brackets are then 

{R,U} = -P\\ + ;;j^Fo-[F"'.P^^-—{vPrU)], (3.598) 
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and 



-^(^'ir = ■(vP|rC/)-(vPrf/r-H'^-P|i) 

. (classical order) 
{R,a} = { ^ (3.600) 

e{R, R} ■ V-Pl ■ a (higher order), 

, (classical order) 
{U,a} = { ^ (3.601) 

e{C/, i?} ■ V-Pi ■ & (higher order), 

{a,^} = -^Fo-a, (3.602) 



= ^-^^{VP±-af ■Fo-{VP±-a). (3.603) 



This Poisson structure has the Casimir functions. Pi ■ U, Py ■ a, and a ■ a. This 
insures that the constraint Eqs. (|3.56ip . (|3.589jl and (|3.590|1 will hold at all times 
if they hold initially. The physical motion takes place on the hypersurface for 
which the first two of these Casimir functions have the value zero and the third 
has the value one. 

Note that d, like O, has nonvanishing brackets with R and U at higher order. 
Once again, however, the set of functions of R and U form a subset of the set of 
all possible phase functions that is closed under the operation of these Poisson 
brackets; also, Hgc is independent of a. So we can still reduce to the guiding- 
center description. 

Next, we note that these results could have been derived by varying the action 
corresponding to the phase space Lagrangian obtained by rewriting Eq. ()3.489p 
in manifestly boostgauge and gyrogauge invariant format. 



Lgc 



-A + mU + 0(e^) 
Lec 



■ R — - — a ■ ro ■ a 

As 



-XiaU ■ Fo-a - XibU ■ P_L-a 
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-X2aU ■ P\\ ■ a — \2hU ■ J^Q- a 



(3.604) 



This must be varied subject to the constraints, Eqs. (|3.561|1 . (|3.589|1 and f|3.59Ujl . 
We have enforced these constraints by introducing the scalar Lagrange multi- 
phers, Ala, Ai;,, A2a, A2b, and A3. Note that the term involving IZ has disap- 
peared from Vgc when written in these coordinates, because —lilZ ■ R + /xG = 
— /XQ! ■ Fq ■ a/ \b- 

We are going to need these Lagrange multipliers in Chapter El so we compute 
them here for reference. They are rather easy to calculate, especially since we al- 
ready know the Poisson brackets. The Euler-Lagrange equations for coordinates 
U and a are 



= mR — Aia-fo ■ d — \ibPi_ ■ OL 



dH, 



dU 



(3.605) 



and 



a ■ Fo 



-^Fo ■ & + MaU ■ Pn + \2bU ■ To + 2A3P± ■ a + — ^, (3.606) 
\b oa 



dr \Xb 

respectively. Upon multiplication by d ■ Fq and d, the first of these yields 



X^a--^a.Fo■[mR-^ 



(3.607) 



and 



A 



ib = a - { mR 



(3.608) 



respectively. Upon multiplication hy U, U ■ Tq and d, the second yields 









tS) 



and 



and 



A 



A, 



26 



IT" 

As 



1 








e/ud ■ 





y.R-—^ 
oa 



y.R-—^ 
oa 

dH. 



V -R 



•-gc 



da 



(3.609) 



(3.610) 



(3.611) 
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respectively. Note that, in perfect analogy with Eq. fl2.184|l . these results can 
be cast in the form 

A. = c||f, (3.612) 

where the label u runs over all the constraints present (la, lb, 2a, 2b, 3), and 
where 



and 



and 



and 



Sla 
Sl( 
Sl( 



-—a-Fo-{R,R} 

A o 



TTl ( 1 

-—a-F^-[{R,U}--\ 




-^a ■ Fq ■ {R,a\ 



B 



sib 


= moL ■ 


tU 
Sib 


= ma ■ 


Sib 


= 


COL 

Sib 


= ma ■ 


pR 
S2a 


- > 


S2a 


- > 


S2a 


= 


pa 
S2a 


- > 


ti? 
S2b 


- > 


S2b 




S2b 


= 



m 



efia 
efxa 







efxa ■ 





^]V-{R,R} 
f^]V-{R,U} 



efia ■ 
efia ■ 



^]V{R,R} 
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and 



^3^ 



^3 



S3 



S3 



-—a ■ Fq ■ {a,R} + -a 
Xb ^ 

-—a ■ Fo ■ {a, (7| + -a 
As z 

As 

— - — a ■ tQ ■ ja, aj 
Ab 



e^ud ■ ( ^ ) V ■ {i?, i?} 



e^d . ( ^ ) V ■ {i?, t/} 



1 










m 



.(3.613) 



Finally, we note that the equations of motion in these coordinates are easily 
found either by using the Poisson brackets given in Eqs. fj3.597p through ()3.6U3|1 
with the Hamiltonian given in Eq. (13.58111 . or by finding the Euler-Lagrange 
equations from the phase space Lagrangian given in Eq. ()3.6U4|) . The results are 



R 



U 



' F 







Pii 



mc 



{VPwU))-U (3.614) 



U 



-^PWV^B + ^^ 



F'" ■ Pn 



mc 



n 



B 



^B 
III '^iT\ 



B 



2X1 



Tp : {F'" . E'^ )To ■ U 



Pii-s'-(vP||-t/)-(vP||-c/)' -E" - Pii 

e 



■ U 



(VP|| ■ Uf ■ Fo ■ (VP|| ■ U) 
/t = 



u 



(3.615) 
(3.616) 



a 



emc 



-Fo-d+d-(PxV)-P.-77T2-"-^o 



(—) Fo-.Fi + P^: ({VP± ■U)-Fo 
\mc/ \ 

(3.617) 



Note that Eqs. fj3.614|) through ()3.616|1 are identical to the corresponding equa- 
tions in the last subsection. These were gyrogauge invariant anyway, and so were 
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unaffected by the manipulations carried out in this subsection. The equation for 
Q has been replaced by an equation for a; the two may, however, be compared 
term for term in an obvious way. 



Chapter 4 

Relativistic 

Oscillation-Center Theory 

4.1 Discussion 

In this chapter, we shaU consider the perturbation of a guiding center due to the 
presence of an electromagnetic wave of eikonal form. In doing so, we shaU take 
as our unperturbed problem the guiding-center equations of motion, as derived 
in Chapter El Thus we are effectively using the superconvergent Lie transform 
procedure as described in Subsection 12.4. 11 

We are interested in understanding the response of the guiding center to the 
presence of the wave. Towards this end, we seek a transformation to a new 
system of coordinates in which the wave perturbation is removed. Neglecting 
resonant phenomena, it turns out that it is possible to do this to first order, but 
not to second order. At second order, there remains an averaged residual pertur- 
bation to the Hamiltonian that gives rise to the ponderomotive force exerted by 
the wave on the guiding center. Thus, after we transform away the rapid fluctu- 
ations in the guiding-center motion, we are left with the slower ponderomotive 
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effects. 

An analogy with the guiding-center problem may be helpful here. In that cal- 
culation, we averaged over the rapid gyromotion to find the slower drift motion. 
The thing that is drifting is then called a "guiding center." A guiding center 
is a fictitious object whose position and momentum are the gyroaverage of the 
particle position and momentum, respectively. Furthermore, a guiding center 
may be thought of as having an intrinsic or spin angular momentum equal to 
the orbital angular momentum of the underlying gyrating particle. Thus, by 
finding the averaging transformation that eliminates the fast degree of freedom, 
we have discovered a new "macroparticle" that lives on the slow time scale, but 
whose properties derive from those of the original charged particle gyrating on 
the fast time scale. 

Similarly, when a perturbing wave is present and we transform away the as- 
sociated rapid fluctuations, the residual ponderomotive forces may be thought 
of as acting on a new "macroparticle" that is averaged over a wave oscillation 
time scale. We call this new object an "oscillation center." Whereas an individ- 
ual charged particle feels wave fluctuations on a rapid time scale, an oscillation 
center feels only the slower ponderomotive effects; it also feels resonant effects 
(since these are also slow and do not average away), but we shall ignore these 
in our treatment. Thus, a kinetic equation for a plasma of oscillation centers 
would contain only ponderomotive forces and resonant effects. 

The averaged nth-order part of the ponderomotive Hamiltonian is called Kn, 
and we shall derive this for a relativistic guiding center. As has already been 
noted, Ki vanishes if we neglect resonant effects. It was discovered by Gary and 
Kaufman that there exists an intimate connection between the ponderomotive 
Hamiltonian and the plasma's response to a wave. Specifically, K2 is a quadratic 
form in the amplitude of the perturbing wave, and the kernel of this quadratic 
form is the functional derivative of the linear susceptibility with respect to the 
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distribution function. Subsequently, it was found by Kaufman that this relation- 
ship persists to higher order; that is, nonlinear corrections to the susceptibility 
are related to Ks, etc. 

In the traditional approach to studying plasma response to a wave, one be- 
gins with the field equations and the kinetic equation, and studies perturbations 
in the fields and the distribution function about an equilibrium. Though this 
approach is not as systematic as ours, it has at least one advantage: The vector 
potential never appears, so all results obtained by such an analysis are guaran- 
teed to be manifestly gauge invariant. In contrast, Hamiltonian or Lagrangian 
approaches to ponderomotive theory seem to require the use of the vector po- 
tential, so past attempts along these lines have produced results whose gauge 
invariance was either not established, or established only by laborious calculation 
after the fact. 

In this chapter, we shall find that eikonal wave perturbations to the La- 
grangian action for a relativistic charged particle in the guiding-center repre- 
sentation can be written in manifestly gauge-invariant form. To do this, it is 
necessary to abandon the usual approach of expanding the eikonal wave per- 
turbation in a series of Bessel functions of k±p. Instead, we first perform a 
Lagrangian gauge transformation, and then we expand in a series of functions 
that are related to indefinite integrals of Bessel functions. This allows us to de- 
velop an oscillation-center theory to arbitrarily high order in the wave amplitude 
expansion parameter, and be guaranteed of manifest gauge invariance at every 
step of the way. Thus, we can enjoy the benefits of the systematic Lie trans- 
form approach to ponderomotive theory without fear of losing manifest gauge 
invariance. 
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4.2 Eikonal Wave Perturbation 



In single-particle phase space coordinates, an eikonal wave has a four potential 
of the form 

A^{r) = A{r) exp (^-i{j{r)^ + c.c, (4.618) 

where A is the amplitude and i/^ is the phase, and where c.c. denotes the expres- 
sion's complex conjugate. The derivative of i/j with respect to spacetime position 
is the four wavevector, k: 

k = V'0(r). (4.619) 

Both A and k are slowly varying functions of r. That is, an eikonal wave is locally 
a plane wave. To reflect this, we have placed 1/e in front of the phase. Thus, 
the derivative of Ayj with respect to r is ikA^j/e plus terms of order unity that 
involve derivatives of A or of k. 

Furthermore, in this work, we shall take this eikonal expansion parameter to 
be equal to the guiding-center expansion parameter (hence, it is no coincidence 
that we are calling it e). This means that we are considering waves whose char- 
acteristic wavelengths are on the order of a gyroradius, and whose characteristic 
frequencies are on the order of a gyrofrequency. 

We shall now consider the effect of such a wave on the single particle action 
one form in Eq. ()3. 37011 . Replacing A in that equation hy A-\- XA^, we write 

7 = 7" + A7^, (4.620) 

where is the perturbation in the action one form due to the wave, or 

'jyj = -A(r) ■ drexp I -ijjir) ) + c.c. (4.621) 
c V J 

Note that we have introduced a new expansion parameter. A, to order the wave 
amplitude. For the time being, we shall not compare A and e, though more will 
be said about this later. 
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As was remarked earlier, our starting point for the oscillation center Lie 
transform will be the guiding-center equations of motion. Hence, it is necessary 
to write 'y^ in guiding-center coordinates (the above form for is in particle 
coordinates). We apply the guiding-center Lie transform to the above equation 
for 7 to get 

T = Tg, + XT^, (4.622) 

where Tgc is the guiding-center action one form, calculated in Chapter El Then, 
is given by 

r^, = exp(-e£g)7^^, (4.623) 

where g is the generator for the guiding-center transformation. 

Note that we are working only to first order in e. To this order we can take 
= —pa, where p = w/Qb- All other components of g are unnecessary, and may 

be ignored. We shall use the boostgauge invariant set of coordinates {R, U, p, 9) 

described in Section 13. 14L 



4.3 Manifest Gauge Invariance 

At this point in the calculation, the usual approach is to apply the Lie transform 
in Eq. fl4.623|l by simply substituting R + pa for r in Eq. ()4. 62111 . This is 
straightforward, and the result is 

= - I A-dR + + epA ■ cdO] exp \-ip \ exp(zpa-k) -fee, (4.624) 

c \^ mpVLB j \e J 

where we have retained the leading nonvanishing order for each component of 
the one form, and where it is understood that all quantities on the right (such as 
A and c) are now evaluated at R. Since a-k is oscillatory, the second exponential 
in the above expression gives rise to a series of Bessel functions of k±p. 

Unfortunately, the above expression for does not possess manifest gauge 
invariance. To understand why this is, we must qualify what we mean by "man- 
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ifest gauge invariance." A term in the action one form is gauge invariant if it is 
unchanged to within a Lagrangian gauge transformation when A is replaced by 
A + zkA, where A is any slowly varying scalar function of position. Thus, the 
quantity 



is gauge invariant since it is unchanged by this transformation. The quantity 
A - dRexp{ii(j/e) is also gauge invariant since it transforms to itself plus the term 



(where we have neglected higher-order terms in e) , and this can be removed by a 
Lagrangian gauge transformation. We shall say that a term is manifestly gauge 
invariant if it has the form A ■ dRexp^itl^/e), or if it depends on A only through 
its dependence on F. 

Thus the first term on the right hand side of Eq. (|4.624|1 is manifestly gauge 
invariant, but the other two terms are not. They are gauge invariant (as they 
must be), since to leading order in e we have 



and this can be removed by a Lagrangian gauge transformation, but they are 
not manifestly gauge invariant. 

If we were to use Eq. fj4.624|) as the starting point for our ponderomotive 
theory, we would obtain results for that are not manifestly gauge invari- 
ant. We could get around this problem if there were some way of manipulating 
Eq. fj4.624p into manifestly gauge- invariant form. It turns out that this can be 
done by making a particular Lagrangian gauge transformation, but this trans- 
formation is far from obvious and needs to be motivated. As we shall now see, 
this motivation comes from the homotopy formula. 



F = i{kA - Ak) 



(4.625) 




(4.626) 
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Return to Eq. fl4.623j1 . and expand the exponential in a series of Lie deriva- 
tives 



J! 



(4.628) 



Applying the generalized homotopy formula, Eq. ()2.76p . we get 



[{igdy + {digY] 7, 



(4.629) 



Note that we have split off the j = term from the sum because Eq. (|2.76jl is 
valid only for j > 1. The above may now be written in the suggestive form 



{igdy-f^ 



+ d 



1v 



(4.630) 



Note that the second term in square brackets is an exact one form, and may 
therefore be removed by a Lagrangian gauge transformation. The first term in 
square brackets has two pieces: The first is 'y^ itself, which we know is manifestly 
gauge invariant. The second is a series of terms all of which have the operator 
igd, raised to some power, operating on 7^^. Thus, in all these terms, the very 
first operator to be applied to ■jw is the exterior derivative. Now 



6 ~ ft 

d'lw = F : dr A drexp I -i(;(r) ] + c.c. 

2ec V e ' 



(4.631) 



(plus higher-order terms), and this is manifestly gauge invariant. Subsequent 
applications of ig and d preserve this manifest gauge invariance. Thus the term 
in the first square brackets on the right hand side of Eq. fj4.630|l is manifestly 
gauge invariant. Thus, Eq. fl4.63()|l gives us the Lagrangian gauge transformation 
that leaves F^^ in manifestly gauge invariant form. 

At this point, one may wonder why we have bothered to keep all the terms 
in the above series when we have said that we are interested in only the lowest 
nonvanishing order in e. Note that when we apply differential operators to 'y^, as 
given by Eq. fj4.62Hl . we pull out factors of 1/e. This means that even terms with 
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very high j can make order unity contributions. Thus, it is important to keep 
all the terms of the series as given above. This situation arises as a consequence 
of the nonanalyticity of in e. It wiU become more clear momentarily. 

To proceed, we need expressions for {igdY'-yw and igidigY'^w, for j > 1. To 
get such expressions, we simply evaluate them for the first few values of j, notice 
the pattern, and prove it by mathematical induction. The results are 





le , 
c 




= -0 


ec 




= -0 

ec 


and 




igidigY'jw 


e f i 



(^7'^-k)^-V-^exp(-V) + 



ig'' ■ k 



ig'' ■ k 



i-2 



J-2 



dl_ 
djj. 

de 



■ F ■ exp -i/; + C 



+ c.c. 

1 



F-/exp [-tl)\+0 



1 



(^7^-k)V-^exp(-V)+0 



1 



+ c.c. 



+ c.c. 



+ c.c. 



(4.632) 



Note that the components of {igdY'^w are manifestly gauge invariant, as 
promised. Then igidigYj^ is not manifestly gauge invariant, but this is the 
term that will be removed by the Lagrangian gauge transformation. Thus, ev- 
erything is going as planned. 

Now we must plug the above results into Eq. fj4.63Up , and sum the series over 
j. This is straightforward, and the result is 

' exp{—ig'^ ■ k) — 1 



-ig^ ■ F 



(7--k 



exp \-^]+ 0{e 



■ dR 



e 

+ - 

c 






e 

+ - 

c 


dg- 


^ de 


-d 


iee „ ~ 
—g'^-A 
c 



■F.g^ 



(1 + z^^-k) exp(-z^'^ • k) - 1 

(T^kp 
(1 + z^^-k) exp(-z£?^ ■ k) - 1 



exp \-^]+ 0{e 



k)2 



exp ( -^A ) + 0(e 



exp(— ■ k) — 1 



exp ( -^A ) + C(e' 



+ c.c. 



(7'--k 

At this point, we can check the above result by actually applying the exterior 
derivative to the last term in square brackets. There is extensive cancellation. 



dn 

de 

(4.633) 
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and we are left with Eq. ()4 .62411 . as expected. We can now make the Lagrangian 
gauge transformation, 



where 



_ zee ^ ~ / expi-ig"^ ■ k) - 1 

= 9 ■ A — j 



exp ( -ip \ + c.c. 



(4.634) 



(4.635) 



thereby removing the last term of Eq. ()4.633p to get a manifestly gauge invariant 
one form, as desired. 

Now g'^ = —pa, and we can substitute this into Eq. fl4.633|l . Note that the p 
component of T'^ vanishes because and dg^ /dfi are both in the a direction, 
and they are both dotted into the antisymmetric two form, F. The 9 component 
does not vanish, however, because dg^ /dO is in the c direction. We finally have 

exp(zpa ■ k) — 1' 



A + pk-F 

2 



ipk ■ k 



+ 0{t 



dRexp -ifj 



eep^ Fo • F f (I'^P^-^^) exp(zpa ■ k) -1 | ^ ^^^^^ 



2cXb'" " ' V (pa-k)2 
+0{e^)dp + 0{e^) ■ dU + c.c. 



dOexp ( -if; 



(4.636) 



To proceed, we must Fourier analyze the above expression in preparation for the 
oscillation-center Lie transformation. 



4.4 Fourier Expansion in Gyroangle 

We now write the components of k in the basis, introduced back in Chapter IHl 
as follows: 

k = k|| — k^(e2 sina + 63 cos a), (4.637) 

where ky lies entirely within the parallel two- flat. The geometrical situation is 
illustrated schematically in Fig. 14.11 Then, using Eq. fj3.346|) , we find 

a-k = kisin(^-a). (4.638) 
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Figure 4.1: Components of the Four Wavevector 
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Now we may Fourier Expand the quantities 



^J,(k^p)e^'(^-°), (4.639) 



exp(zpa ■ k) - 1 _ e*^''"^"^^-") - 1 
ipk-h ik±psm{9 — a) 

= 5^QKk±p)e^^(^-") (4.640) 



and 



;i - ipa ■ k) exp(zpa ■ k) - 1 (1 - zkipsin(^ - a)) e^kipsm(e-a) _ ^ 



(pa-k)2 klp^sm^{e-a) 

= ^5^i?,(k±p)e^'(^-"\ (4.641) 



where the Ji are Bessel functions, 



and 



Je{x) = — / d^e*"^^"^-*^^ (4.642) 
27r Jo 

where we have defined the special functions 

1 f'^'^ /pia: sing _ 1 \ 

TT Jo V a;^ sm 4 / 

and where the summations over i extend from minus infinity to infinity. The 

properties of the Q and R functions will be explored in detail in Appendix iDl 
Now, along with the expressions in Eqs. fj4.639p . fj4.64U|l and fj4.641|) . and 
also contains the 6'-dependent (and hence oscillatory) quantities, c and a. 

Thus we need to know how to Fourier expand these as well. Using Eq. fj3.346p . 

we may write 

c = ^(e+e*^ - e_e-^^) (4.645) 
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and 



where we have defined 



a = ;^(e+e'^ + e.e"^^), (4.646) 



e± = -L(e2±ze3). (4.647) 

Note that these are complex unit vectors that obey = ezp, ej- ■ e± = 0, and 
■ e-i- = 1. Because they contain e^*^, when we multiply them by the series 
in Eqs. (j4.6:Wj) . (I4.64()j) and (14.641 j) . they wiU generate terms with e'^^^^^^. By 
defining new summation variables we can restore these to the form e*^^, but then 
these terms will be left with special functions that have indices i ±1. 
Now then, we may write as follows: 

= ^(r^i? ■ dR + eV^^dii + eV^ede) exp {U>}j + ^(e^) + c.c, (4.648) 



where 



and 



V^R = ^JiA, (4.649) 



Tee = ^Jf ■ i, (4.651) 
V2c 



and where we have defined 

^e{R,0) = ij{R) + ee{e-a{R)) (4.652) 

and 

J-± = e+e^«j£_i ± e_e-^« J,+i. (4.653) 
Similarly, we may write as follows: 

= ■ dR + eT',,de) exp (^^vl/,^ + 0{e^) + c.c, (4.654) 



Relativistic Oscillation-Center Theory 



167 



where 



and 



J- e 



v2 



V' 

-L 



and where we have defined 



4cA 



B 



-ReFo : F + 0{e^ 



(4.655) 



(4.656) 



(4.657) 



In the above expressions, it is understood that J^, Qi and Ri are evaluated at 
k_Lp. 

FinaUy, note that St-, as defined by Eq. (|4.635p . has the Fourier decomposi- 
tion, 

St = e^STiexp (-"^A + c.c. (4.658) 



where 



S 



re 



Qt ■ A. 



V2i 



(4.659) 



Using Eqs. (|4. 64811 . (j4. 65411 and (|4.658ll . it is possible to check that = + 
(ISt- 



4.5 The Oscillation-Center Lie Transform 

Our aim is to perform a Lie transform that will remove all the effects of the wave 
from the Poisson structure, and put them into the Hamiltonian. Thus, when we 
have completed this task, our Poisson brackets will be identical to those for a 
guiding center with no wave present (through order A^). The effect of the wave 
will be pushed into a term of order in the Hamiltonian. We shall do this both 
for and for F^, in order to verify that we get the same answer either way. 

We now reset our variables, so that F^ (as given by Eq. fj4.648|l ) and F^ 
(as given by Eq. ()4.654p ) will henceforth be called 'y^ and respectively. The 
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oscillation-center transform will take us to and F^, but we want these to 
vanish by the above argument. Thus, in Eqs. fl2.273|l and fl2.274|l we demand that 
Fi and F2 vanish. This is the step at which we are neglecting resonant effects. 
Furthermore, in Eq. (|2.274jl we have 72 = because our wave perturbation is at 
first order in A only, and Oi = because Fi = 0. 

First consider the oscillation-center transform of 7^1;. We have 

= 'jw - ii(^gc + dSi, (4.660) 

and 

= ~i2UJgc - ^iiujyj + dS2. (4.661) 

Meanwhile, the Hamiltonian transforms according to Eqs. fj2.221|) through 
to give 

Ko = ifgc, (4.662) 

Ki = -CiHg, = -iidHg,, (4.663) 

and 

K2 = -C^Hg, + ^CjHg,. (4.664) 

Now we demand that Ki = —iidHgc = 0. Let zq denote interior multiplica- 
tion by z (the unperturbed flow), so ioujgc = —dHgc (our unperturbed problem 
is the guiding-center problem). Then, applying io to Eq. ()4.66U|1 gives 

5"! = -io^w + hdHgc = -io'jw, (4.665) 

where the last step follows as a result of our demand that Ki = 0. We can 
integrate this last equation along unperturbed orbits to get 5"!. Then gi is given 
by Eq. ^nm^ 

91 = hu. + dSi) ■ Jg,. (4.666) 
At second order, we can solve Eq. (14.66111 for (72 as follows: 

92 = {-^ii^^w + dS2) ■ Jgc- (4.667) 
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Now then, we can insert these generators into Eq. ()4.664j1 to get K2, as foUows: 



K2 = —^2Hgc + -CfHgc 



1 

2' 

-C2Hgc 



— {-h<^w — dS2) ■ Jgc ■ dHgc 

= ^ioiiUJw - S2. (4.668) 

Now we can choose 5*2 to remove the osciUatory part of the first term. Note 
that we cannot remove the averaged part of the first term, because that would 
introduce secular terms in 5*2. So the best that we can do is to take 

K2 = (liohu;^) . (4.669) 



2 

This is the ponderomotive Hamiltonian. 

Now suppose that we had started with 7^ = 7^^ -\-dST instead of •jw Instead 
of Eqs. fj4.66mi and ()4.66H1 . we would have written 

0^y^-iru;gc + dS[, (4.670) 

and 

= -i2'UJgc- ^ii'UJw + dS2, (4.671) 

where in' is an obvious shorthand for ig'^, and where we are adhering to the 
convention of using primes to denote quantities arising from the Lagrangian 
gauge transformed action one form. Of course, we still would have taken Kq = 
Kq = Hgc and we still would have demanded that K[ = —ii'dHgc = = 
Ki. From this it follows that K2 = —i2'dHgc. Thus, if we could show that 
92 = 921 it would immediately follow that K2 = K2; that is, it would follow 
that the ponderomotive Hamiltonian is invariant under the Lagrangian gauge 
transformation. 

From Eq. fj4.67U|l . we have 

S[ = iodS[ = -iolL + h'dHgc = -iolw = -ioilw + dSr) = Si - St, (4.672) 
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so 

S[ = Si- St. (4.673) 

Then 

9i = {l'w+dS[)-Jgc = {jui+dST+dSi-dST)-Jgc = {lw+dSi)-Jgc = gi, (4.674) 

So gi is invariant under the Lagrangian gauge transformation. Next, from 
Eq. ()4. 67111 we have 

fi'2 = (^-^h'(^w + dS2^ ■ Jgc, (4.675) 



so 



K2 = (^ii'UJy, - dS'^ ■ Jgc ■ dHgc = ^^0^'^^^, -82 = ^ioiiuj^ - (4.676) 



Thus we have 



so 



S2 = 7;^oh(^w - (7;^oii(^w) = S2, (4.677) 



2 " ^ \2 



S'2 = S2, (4.678) 



and so 



g'2 = g2. (4.679) 

It immediately follows that 

K!^ = (^^ioiiu^'j = K2, (4.680) 

so the ponderomotive Hamiltonian is indeed invariant under the Lagrangian 
gauge transformation. Note that gi, (72, and S2 are also thus invariant, but that 
7^ and Si are not. The latter two quantities transform under the Lagrangian 
gauge transformation as follows: 

'yL='yn.+dST (4.681) 
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and 

S[ = Si-St, (4.682) 

so that the combination + dSi is invariant. 

Though we have just shown that we would get the same answer for the 
ponderomotive Hamiltonian either way, it bears repeating that the advantage of 
starting with 7^ is its manifest gauge invariance. In the next section, we shah 
further discuss the relative merits of each of the two ways of calculating K2. 

While the above expression, Eq. (|4.669jl . for the ponderomotive Hamiltonian 
is wonderfully compact, it is also very formal. We need to plug in Eq. ()4.648p 
and/or Eq. ()4.654p . and work it out in detail. This is done in the next section. 

4.6 The Ponderomotive Hamiltonian 

Our unperturbed equations of motion are 

R = U + eUd 
U = 0{1) 
fi = 

and 

e = -Qb, (4.683) 
e 

where Ud denotes the guiding-center drift motion, and where we do not need 
to know anything about U other than the fact that it is order unity in e. Then 
Eq. ()4.665p for Si becomes 

and Eq. ()4.672|1 for S'l becomes 

S'i = -Yl ■ + ^^^) + ^^^"«] f-"^^) + (4.685) 
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Integrating over unperturbed orbits, we get 

Si = iey2 biR ■ (U + eUd) + lie^e] """"^f?^'^ + ex., (4.686) 



and 

e ^ 
respectively. Here we have defined the resonant denominator 



S{ = ^eY. Wi? ■ (t^ + ^^d) + iiM ^^PiiM + c.c, (4.687) 



D£ ^^'^ = k- (t/ + et/d)+£OB. (4.688) 

Using Eas. (|4.658|l . (j4.686|l and (|4.687|1 . it is possible to verify Eq. (j4.67;-Sll : that 
is, it is possible to show explicitly that 3'^ = S\~ St- 

Now we use Eq. fl4.666|l to get the components of the generator gi, 

+C.C. + C(e2), (4.689) 

9i = -y2(Hiik+-JiDiA)-P\\^^^^^^ + c.c. + 0{e), (4.690) 
m ^-^ \ c / " Dp 

^ V (ifi.k + -JiDtA) ■ [/^^pIiM + c.c. + 0(e), (4.691) 
VLb ^ \ c / Df 



9i 



and 

9t 



T \ ■ ^0 • AJe + ^{Ji+i - Jt-i)U ■ A 

1 ~ _u 1 exp f^^'^) 

' +C.C. + 0(e), (4.692) 



where we have defined 



Hu ^ -\ {UJ, + '-^J^^ ■ A. (4.693) 

If we had instead used the the first of Eqs. ()4.674p . we would have obtained the 
following results for the components of g'^. 
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+ iP\\ 1 ^ + c.c. + 0{^) 



9? 



Xb^b 
> I t/J^ + 

mc 



D2 



(4.694) 



exp (^^'^) 
•Pii \' ' + c.c. + 0(e), 



and 



51 



J^^:E(^^.-)^^ + c.c. + 0(e), 



Dp 



(4.695) 
(4.696) 



2mc\B 



exp [l^p) 



Dp 



+C.C. + 0(e). 



(4.697) 



By straightforward calculation, it is possible to directly verify that g'l = gi, 
as required by Eq. (|4.674|1 . To do this, simply substitute F = i(kA — Ak) into 
Eqs. ()4.694p through ()4.697p ; upon simplification, the results will be Eqs. ()4.689p 
through ()4.692|) . If we had not made the Lagrangian gauge transformation, 
and had instead started with only and 5*1, we might have had difficulty 
casting Eqs. fj4.689|l through fj4.692|) in the manifestly gauge-invariant form of 
Eqs. piMll through (jil)!F7|l . 

Next we compute the components of uj^ = d^yw Direct calculation gives 



W7, 



i^yji exp ( -^i j + c.c. 



where 



%e ~ ~ 
i^wiRR = —Ji{\^A - Ak), 
ec 

^wiRU = C'(e^), 



B 



ik 



i^wtRe 



ep 



{Ji+i — Ji-i) 1 



A. 



kJi + 



^P2c\ ^ P 
^weuu = C'(e^) 



■JA -A. 



(4.698) 

(4.699) 
(4.700) 
(4.701) 

(4.702) 
(4.703) 
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co^iUf. = 0(e2), (4.704) 
i^u^me = 0(e'), (4.705) 

and 

to^^^,e = Fo- AJi. (4.706) 

If we had instead used u'^ = dY^, we would have obtained the foUowing results: 

= exp (^^^i^ + c.c, (4.707) 



where 



^LiRR = ^^JiF. (4.708) 
c^Lmu = Oie'), (4.709) 
'^Lm^ = -T^F . J+, (4.710) 



B 



^Lme = ^F.Jf, (4.711) 

^Liuu = 0{e'), (4.712) 

^Liu>. = (4.713) 

^Liuo = 0{e^), (4.714) 



and 



^Li^e = -^^0 ^J^- (4.715) 
By direct calculation, it is once again possible to verify that u'^ = dY^ = di j^ + 
dSr) = d'-yw = by simply substituting F = i{kA — Ak) into the results for 
the components of uj'^ and simplifying to get the components of iVyj. 

Before using the above results to calculate K2, we digress for one last discus- 
sion about the relative merits of starting with 'j^ and 7^. First note that all of 
the components of gi and are indeed manifestly gauge invariant. If we had 
started the calculation with 7^, this would not be a surprise since is itself 
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manifestly gauge invariant; if however we had started the calculation with 7^,, 
the manifest gauge invariance of the result would seem fortuitous. In the latter 
event, we would have had results in terms of A, and only through some tedious 
algebraic manipulations would we have discovered that their dependence on A 
arose only through a dependence on F. On the other hand, note that the only 
special functions that appear in the components of gi and Uw are the Bessel 
functions, Ji. The Qi and Ri functions have all disappeared in favor of the Ji. 
If we had started the calculation with 'y^, this would not be a surprise since 
7^t, itself depends only on the J^, and not on the Qi and Ri; if however we had 
started the calculation with 7^, the disappearance of the Qi and Ri functions 
would seem fortuitous. In the latter event, we would have had results in terms of 
the Qe and Ri functions, and only through some tedious algebraic manipulations 
would we have discovered that the recursion relations and derivative formulas 
could be used to cast them in terms of Ji alone. There is thus a peculiar duality 
between the presence of special functions and of manifest gauge invariance. 

We now insert the above formulas into our expression for K2- The averaging 
is carried out as follows: 

exp|^^*^^ exp(^^'ifi'^'^ =5ie'. (4.716) 
We get 



K2 = i^{A* ■PyA+ Y^r^ -UA* ■Fo-A + c.c.) + ^21, (4.717) 



where 



K2e = ^|k.P||.ij,-%-^ 
mcDi I " piiB 



-{Ji-i — Ji+i) 



20 



B 



Jt + T^^o ■ kjJ ■ a\ 



.i^k-P||.k + c.c. + 0(e), (4.718) 
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where c.c. denotes the complex conjugate, and where Hu is defined in 
Eq. ()4.693|1 . If we had instead computed K2 according to Eq. ()4.676|1 , we would 
have obtained the result, 



Once again, by substituting F = i{kA — Ak) into Eq. (|4 .72011 and simplifying, it 
is possible to reduce the expression to Eq. ()4.718jl . thus directly verifying that 
K2 = K2- In the course of this calculation, some of the sum rules of Appendix lEl 
are useful. Henceforth we shall drop the prime in our notation, and refer to 
the ponderomotive Hamiltonian only as K2, whether or not it is in manifestly 
gauge-invariant form. 

Note that K2 is a function of the phase space coordinates, R, [/, ^ and 9; in 
particular, it depends on R through its dependence on the background fields, 
Fo{R) and Fi{R), and through its dependence on the eikonal wave field param- 
eters, F{R) and k{R). Thus we write K2{Z; F{R), k(i?)), where z = 0, 1. 

The ponderomotive Hamiltonian will be used extensively in the next chap- 
ter where we shall study the self-consistent dynamics of magnetized relativistic 
plasma in an eikonal wave field. 




(4.719) 



where 
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4.7 Obtaining the Ponderomotive Hamiltonian 
Using Canonical Lie Transforms 

Grebogi and Littlejohn jS] have obtained the ponderomotive Hamiltonian by 
first performing a single noncanonical coordinate transformation to remove the 
perturbation from the action one form, and then using canonical Lie transforms 
on the Hamiltonian. We shall use that procedure in this section in order to check 
our above result for K2- 

Let us return to the point at which the wave perturbation was first added 
to the single-particle action one form in Eq. fl4.()2()|l . Recall the definition of the 
single-particle velocity u in Eq. ()3.366p . Suppose that we change this definition 
to absorb the wave perturbation; that is, we adopt the following new definition 
for u: 



This has the effect of returning the action one form to the functional form that it 
had before the wave was introduced. Of course, the definitions of the quantities 
that appear in the one form will be different; that is, u and anything that depends 
on u (e.g. k, /3, w, and 9) will be defined differently in terms of the single-particle 
position and velocity. Nevertheless, the action one form is returned to the form 
that it had when no wave was present, and now we can apply the usual guiding- 
center transformation to take it to the guiding-center action one form r^^, given 
implicitly in Eq. ()3.582|1 . with no remaining perturbation due to the wave. 

Whereas the action one form has thus been simplified by this transforma- 
tion, the Hamiltonian, Eq. fl3.367jl . now becomes considerably more complicated. 
Using the new definition of u in Eq. fj3.364|) . we have 




(4.721) 



H'{r, u) = H'^{r, u) + \H[{r, u) + \^H'^{r, u), 



(4.722) 
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where 



H;,{r,u) = -u' (4.723) 
H[{r, u) = --U ■ A{r) exp (r)^ + c.c. (4.724) 



and 



H'Jr, u) = oA{r) ■ A*{r) H nA{r) ■ A{r) exp —if; (r) + c.c. (4.725) 

2mc^ 2mc^ \ e / 

At this point we can apply the guiding-center transformation, (r, u) i— >■ 
(i?, [/, /U, O), which may be taken to be simply R = r — ep to the order to which 
we are working. The result may be Fourier expanded in the gyroangle using the 
usual Bessel function identities. The result is 

H = Ho + XHi + X^H2, (4.726) 

where 



Ho='^U^ + pnB (4.727) 
is the usual guiding-center Hamiltonian (to lowest order), where 

Hi = ^Huexp (-"^ij +C.C. (4.728) 

with Hig given by Eq. fj4.693|) . and where 

e2 . . 

H2 = — ttA ■ A* + osciUatory terms. (4.729) 

To recap, we have applied a noncanonical transformation to remove the per- 
turbation from the Poisson structure and deposit it in the Hamiltonian. We 
can now use a canonical Lie transform to remove Hi (neglecting resonances) 
and average H2 to get K2. Note that this method does not preserve manifest 
gauge invariance; that was lost in the very first step when we redefined w in a 
gauge-dependent way. 
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Applying canonical Lie transform perturbation theory, at first order we have 
from Eq. iHrWi} 

O^Ki^Hi + {Wi,Ho}, (4.730) 



{Wi,Ho} = -Hi = -^ifi.exp Q*,^ +C.C. (4.731) 
Integrate this along unperturbed orbits to get the scalar generator 

W^i =ie^^exp 1^^^'^^ +C.C. (4.732) 

Proceeding to second order, we have from Eq. fj2.252|) 

K2 = H2 + {W2, Ho} + ^{Wi, Hi}. (4.733) 

Now W2 is chosen to average the result, so without having to explicitly calculate 
it, we can write 

K2 = (h2 + . (4.734) 

After a short calculation, this reduces to the result 

,2 1 .12 



i.i--i5:{*„HHL} + c.c. 



2mc2 2^' ' D 

- ■ - - y f-k|| • + + c.c. (4.735) 

That this answer is equal to our previous result for K2 may be proved by ex- 
panding the derivatives in Eq. ()4.735p . replacing I by [D^ — k ■ (t/ + dJd)]/^B-, 
and using the sum rules of Appendix [E] to sum the terms with no resonant 
denominator. The result is Eqs. ()4.717j) and fj4.718jl . 

Note that this is by far the easiest way to get K2- Furthermore, it yields the 
result in a considerably more compact form than the Lagrangian Lie transform 
approach does. On the other hand, as has already been noted, it does not yield 
the result in manifestly gauge invariant form. 
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This result may be compared with that of Grebogi and Littlejohn |Hj who 
used "1 + 3" notation and whose result was gauge invariant but not manifestly 
so. To make this comparison, use the technique for translating our results into 
"1 + 3" notation that was introduced back in Section [3. 131 It is then a straight- 
forward exercise to show that our ponderomotive Hamiltonian gives rise to the 
same equations of motion as that of Littlejohn and Grebogi, though the two are 
not numerically equal. The reason that the two results for K2 are not numer- 
ically equal can be traced back to the fact that the corresponding unperturbed 
Hamiltonians are not numerically equal. This is because Littlejohn and Grebogi 
started with the Hamiltonian (written in terms of three-vector coordinates and 
velocities) , 



which is not numerically equal to the Hamiltonian that we started with, though 
it does yield the same equations of motion. 

It is easier to compare our result with that of Achterberg jlHl who used a 
four-vector approach, but who did not worry about manifest gauge invariance 
and who used essentially the same method outlined in this section. His result is 
identical to our Eq. ()4.735|1 . outside of some minor notational differences. 




(4.736) 



Chapter 5 

The Relativistic 
Guiding-Center Plasma 

5.1 Discussion 

The reason that a Vlasov plasma is a nonhnear medium is that the plasma 
currents generate fields which in turn drive the motion of the plasma. Up until 
now in this thesis, we have dealt only with single particles (or single guiding 
centers or single guiding/oscillation centers) moving in fields that are known in 
advance as fixed functions of spacetime. In this final chapter, we show how to 
pass from this single particle description to a self- consistent description of the 
dynamics of the guiding-center plasma; this includes the dynamics of the fields 
as well as that of the particles. We shall do this by imbedding the single particle 
action in a system action, and coupling it to the Maxwell field. 

In Section 15.21 we prove Liouville's theorem, and show how to write the 
Vlasov equation in any desired coordinate system. In Section 15. 3| we sum the 
guiding-center Lagrangian action over a full distribution of guiding centers and 
couple to the Maxwell field in order to obtain the Lagrangian action of the full 
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guiding-center Vlasov plasma. The variation of this with respect to the guiding- 
center coordinates yields the relativistic kinetic equation for guiding centers, 
while the variation with respect to the four potential yields the self-consistent 
field equation including the guiding-center magnetization and current densities. 

In Section f5. 41 Noether's theorem is applied constructively to obtain covari- 
ant conservation laws for the momentum-energy and the angular momentum of 
a guiding-center plasma. That is, we obtain the stress-energy and angular mo- 
mentum tensors of the guiding-center plasma, including the contribution to the 
angular momentum due to guiding-center spin. 

Finally, in Section 15.51 we employ the results of Chapter to generalize 
the results of Sections 15.31 and 15.41 to the case of a guiding-center plasma in an 
eikonal wave field. We begin by forming a system action, this time including the 
Maxwell action of the eikonal wave field, and the ponderomotive Hamiltonian of 
the guiding/oscillation centers. Variation with respect to the coordinates again 
yields the kinetic equation, which now includes a term due to the ponderomotive 
effects caused by the wave field. Variation with respect to the four potential of 
the background field again yields the self-consistent field equation, which now 
includes a modification in the magnetization density due to the presence of the 
wave. There are then two new additional variations: Variation with respect to 
the eikonal wave field amplitude yields the linear dispersion relation for the wave, 
and variation with respect to the eikonal wave phase yields the conservation law 
for wave action. Constructive application of Noether's theorem to this new 
system action yields the laws of conservation of energy-momentum and angular 
momentum for the combined system of plasma, background field, and wave 
field. Specifically, the modification to the stress-energy and angular momentum 
tensors due to the presence of the wave field is presented and discussed. 



The Relativistic Guiding-Center Plasma 

5.2 Liouville's Theorem 



183 



5.2.1 Lagrangian and Eulerian Descriptions of Relativistic 
Plasma 

In this section, we present a version of Liouville's theorem that is valid for 
relativistic Hamiltonian systems with noncanonical coordinates. We begin by 
examining the difference between the Lagrangian and Eulerian descriptions of 
relativistic kinetic theory. 

Recall that a Lagrangian description keeps track of the trajectory of each 
particle of the system, whereas an Eulerian description uses a distribution func- 
tion to specify the phase-space density of particles (we discussed this briefly in 
Section f2. 3. 3p . Thus, a Lagrangian description for a system of relativistic parti- 
cles might be the specification of z{r], r), where z denotes a set of n-dimensional 
phase space coordinates, 77 is a continuous particle label, and T{r]) is an orbit 
parameter along the world line of the particle with label 77. Specifying z as a 
function of r] and r is equivalent to specifying the phase space orbit of every 
particle in the system. The corresponding Eulerian distribution is 

U{Z) = J dN{r,) J dr{rj)5^{Z-z{v.r)). (5.737) 

Here dN{r]) is some measure describing the number of particles with labels be- 
tween r] and r] + dr]. This measure appears when we pass from the discrete to 
the continuum description; that is 

^ _ I dN{r^). (5.738) 

particles 

In what follows, we shall frequently not bother to write the explicit r] dependence 
of r, but it should be kept in mind that each particle has its own proper time. 

Note that f{Z) has support only on a space of dimension smaller than that 
of the full n-dimensional phase space. This is because there are constraints 
that must be satisfied by the various coordinates involved. For example, single- 
particle dynamics must remain on the mass shell, since u ■ u = —(? . Upon 
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making the guiding-center transformation, this requirement is easily seen to 
become Hgc = (the guiding-center transformation is a difFeomorphism, 

so the mass sheU is distorted but not topologicaUy altered). So, when using the 
{R, K, B, /U, O) coordinates, / has support on a seven dimensional submanifold in 
an eight dimensional phase space. When we use the (i2, U, fi, O) coordinates the 
phase space is ten dimensional, and when we use the (i2, U, fi, a) coordinates the 
phase space is thirteen dimensional; in all cases, however, / has support only on 
a manifold of seven dimensions thanks to the constraints on these coordinates. 

The Lagrangian description keeps track of the dynamics of all the particles in 
the system as though they were distinguishable, and so it includes more degrees 
of freedom than the Eulerian description. That is why it is possible to write the 
Eulerian distribution f{Z) in terms of the Lagrangian description z{r],T), but 
it is impossible to do the reverse. There are many different functional forms for 
z{r], r) that yield the same f{Z). Nevertheless, for a plasma of indistinguishable 
particles (we are not going to bother about species labelling in this thesis) it 
is clear that any physically relevant quantity can be expressed in terms of the 
Eulerian distribution, f{Z). This is because any physically relevant quantity 
should not depend on the identity of the individual particles in the system. 

This is really a gauge invariance issue. The gauge group is the group of 
identical particle interchanges. The Lagrangian description keeps track of extra 
nonphysical gauge degrees of freedom. A physically relevant quantity can be 
written in terms of the Eulerian distribution since it is gauge invariant in this 
regard. 

Consider for example the value of some phase function, $(2), summed over 
all the particles in the system and integrated along world lines 




(5.739) 



This object is invariant under the gauge group of identical particle interchanges 
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because it can be written in terms of the Eulerian distribution as follows: 



(TZ j dN{r]) j dr5"(Z-2(77,r))$(Z) 



d^ZUZ)^{Z). 



(5.740) 



Though we shall frequently work with the Lagrangian description of things, we 
must be able to show that our results can be expressed in terms of the Eulerian 
distribution. Fortunately, this will pose no problem. 

The Lagrangian description of the dynamics of the system is then given by 



i(?^,r) = V{[z\,z{r], r)). 



(5.741) 



where the dot denotes differentiation with respect to r, and where V is the 
dynamical vector field expressed as a function of zij], r) and as a functional of z 
(since the dynamics of one particle may depend on the phase space positions of 
all the other particles in the system). The corresponding Eulerian description 
of the dynamics is then found as follows: 







- j dN{r^) J dr^6^{Z-z{^,T)) 
j dN{ri) j dTz{v,r)-^S^{Z-z{v,r)) 
dN{r]) j drzif], r)5"(Z - ^(77, r)) 
dN{r^) j dTV{[z\,z{Ti,r))5^{Z - z{ti,t)) 
V{[UZ) j dN{7j) J dT5^{Z-z{v,r)) 
■[Vi[UZ)U{Z)]. 



d_ 
dZ 

d_ 
dZ 

d_ 
dZ 
_d_ 
dZ 



(5.742) 



The first line above follows from the fact that at any finite time r is finite, so 
the delta function vanishes at the limits of integration r ±00. Note that we 
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had to assume that the functional dependence of V on 2; could be replaced by a 
functional dependence on /„; this is just a statement of the very reasonable con- 
dition that the dynamics cannot depend on particle labels. The resulting kinetic 
equation for fn{Z) is called the continuity equation, and it expresses conserva- 
tion of particles. It is true for any relativistic system of particles, regardless of 
the nature of the forces involved (they could even be dissipative in nature) . 

5.2.2 Conservation of Phase Space Volume 

One thing that distinguishes Hamiltonian systems from other dynamical systems 
is the property that phase space volume is conserved by a Hamiltonian flow. 
This means that if we take a volume element in phase space and drag each point 
of its boundary surface along a Hamiltonian vector field for some parameter 
increment, the volume enclosed will be unchanged. As we shall now see, this 
property follows from the Jacobi identity; this fact was used in Section [3.101 as an 
argument for using brackets that satisfy the Jacobi identity exactly (as opposed 
to satisfying it only to some order in an expansion parameter). 

Suppose that we have a set of canonical coordinates Zc, and that the Eule- 
rian distribution function in these coordinates is jJyZ^. Now under a (possibly 
noncanonical) coordinate transformation, Z, a, distribution function trans- 

forms in such a way as to keep the number of particles in a fixed phase space 
volume element constant. That is 



where n is the number of dimensions in phase space. Thus, / transforms like a 
pseudoscalar. 



f{z)(rz 



fciZc)rz, 



(5.743) 



f{Z) = UZ,)D, 



(5.744) 



where we have defined the Jacobian of the transformation 



D{Z) 



(5.745) 
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Alternatively, we can define a scalar distribution function, f(^), which trans- 
forms as follows: 

f(Z) = /,(Ze). (5.746) 
It follows that in any coordinate system we have 

f{Z)=f{Z)D{Z). (5.747) 

Note that f{Z) = f{Z) in any canonical coordinate system, since the Jacobian 
of a canonical transformation is unity. In noncanonical coordinates, however, 
f{Z) and f{Z) are different. 

The Lagrangian two-form in coordinate system Z is given by 

where is the canonical Lagrangian two-form. Taking the determinant of both 
sides, we find 

detO = D^. (5.749) 

We now no longer need to make reference to the canonical coordinate system, 
Zc- Eqs. ()5.747j1 and fl5.749j) tell us all we need to know, and they are written 
entirely in the general coordinates, Z. 

Take the gradient of both sides of Eq. (|5.749|1 to get 

2DD,a = (detO),« 

(5.750) 

where we used the formula for the derivative of a determinant, 

(detA),, = {detA){A-y'yA^f3,^. (5.751) 
We are now ready to prove Liouville's theorem. We have 
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(5.752) 



where we used the above formula for DD^q,, and where we used the Jacobi 
identity in the last step. Thus, since D is never zero, we have proved Liouville's 
theorem, 

^ ■ (ZD) = 0. (5.753) 
Now Eq. ()5.742jl may be written for a Hamiltonian system as follows: 

d 

^ ^ . (ZDf). 



(5.754) 



Applying Liouville's theorem, we get the Vlasov equation. 



= ^-^. (5.755) 

Our proof of this result has been quite general, and so in the future we can simply 
write down the Vlasov equation for any Hamiltonian equations of motion. 

The careful reader will have noticed that we assumed invertibility of 
the Poisson tensor in the above proof, whereas our Poisson tensors in the 
(i?, C/, /U, G) and (i?, U, fx, a) coordinate systems are definitely singular. Re- 
call, however, that we showed in Section 13.141 how these constrained coordi- 
nate systems could be imbedded in larger unconstrained coordinate systems. 
That is, we can obtain the (i?, U, /U, O) coordinates by a smooth coordinate 
transformation from the (i?, K, B, /U, G, Ci^, Cib) coordinates, and we can ob- 
tain the (i?, U, /U, a) coordinates by a smooth coordinate transformation from 
the (i?, K, B, fi, O, Cia, Cif,, C2a, C2b, Cs) coordinates. In both cases, the physi- 
cal motion takes place on the subspace for which Cia = Cu = C2a = C2b = 
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and C3 = 1; if the initial conditions are on this subspace, the dynamics wiU 
keep them there. From this point of view, there is nothing singular about the 
transformation that led to these coordinate systems, and the only reason that 
their Poisson tensors are singular is that we enforced the constraints by setting 
Cia = Cib = C2a = = and C3 = 1 at the very end of the calculation that 
led to them. 

Armed with this insight, it is easy to compute the Jacobian D for these 
coordinate systems. First we consider the guiding-center transformation that 
led to the (i?, K, B, fi, O) coordinates from canonical coordinates. The Jacobian 
of this transformation is 



Di = ^/detflgc, (5.756) 

where Qgc is the Lagrangian two-form given in Eqs. ()3. 49011 through fj3.494|) . 
The result is 

TTl 

Di = — KVLbT. (5.757) 

e 

The coordinates (Cia, Cib, C2a, C2b, C'a), which can be thought of as describing 
directions transverse to those described by the (i?, K, B, /U, 0) coordinates, are 
unaffected by the above transformation. 

We now transform to either the (i?, t/, /U, O) system or the (i?, U, fi, a) sys- 
tem. This transformation will involve the coordinates (Cia, Ci^, C2a, (^2;,, C3). 
Its Jacobian is given by 

D = ^(-^^ Q) 
' a(i?, i3, ^, e, Ci., Ci,) ^ • ^ 

or 

^2 = c>(i?, /X, a) ^ 

d{R, K, B, fl, 6, Cia, Cib, C2a, (^26, C3) 

respectively. We can use the transformation equations, Eqs. ()3.562|) or fj3.59H] . 
to calculate the above expressions. The important thing is that we take all of 
the derivatives involved in calculating the Jacobian before enforcing the con- 
straints by setting Cia = Cu = C2a = C2b = and C3 = 1. The calculation is 
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straightforward, and we find that for either the {R, U, fi, 0) or the (i?, U, fx, a) 
coordinates we get 

1 

2K~Xi 



D2 = (5.760) 



The overall Jacobian of the above transformation is thus 

2 

PTtl 

where T' is given by Eq. fj3.575p . Note that this same expression may be used for 
the guiding/osciUation-center problem, since it has exactly the same brackets as 
the guiding-center problem with no wave present. This is because our oscillation- 
center Lie transform took the wave perturbation out of the brackets and put it 
into the Hamiltonian (which is how we got K2)- 

Thus by imbedding our singular coordinate systems in larger nonsingu- 
lar ones, we are able to validate the above derivation of the Vlasov equa- 
tion for our coordinates. Because we had to introduce the coordinates 
(Cia, Cib, C2a, (^2;,, C3), howcvcr, wc should ask what the distribution func- 
tion looks like, and whether or not the kinetic equation that we have started 
with makes sense. Consider Eq. fl5.737jl . written for the coordinate system 
Z = (i?, i3, /U, O, Cia, C16, C2a, (^26, C3). We adopt the shorthand notation 
Z = {Y,C) where Y = {R,K,B,fi,e) and C = (Ci„, Ci,, Csa, C2b, C3). Then 
we have 

hs{Z) = I dN{r]) j drS' [Y - y (77, r)) 5^ {C - c (77, r)) , (5.762) 

where y{rj,T) and c{r],T) give the dynamics of Y and C, respectively. Note, 
however, that since the integral, J dN{r]), includes only particles that obey the 
constraints Cia = Cib = C2a = C2b = and C3 = 1, and since the dynam- 
ics is known to keep such particles on the constraint surface, it must be that 
c(?7, r) = (0, 0, 0, 0, 1). Thus the delta functions involving C can be pulled out of 
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the integral to finally yield 

hs{Z) = 6{Cia)S{Ci,)6{C2a)S{C2b)S{Cs -I) j dN{ri) j drd^ {Y - y (77, r)) . 

(5.763) 

The proportionality of /13 to delta functions in the C is simply a mathematical 
restatement of our earlier observation that it has support only on a space of 
dimension less than that coordinatized by Z. In fact, it has support only on 
a space of seven dimensions (there is another delta function still hiding in the 
integral on the right hand side of the above equation due to the fact that the 
Hamiltonian is a constant of the motion) . The Vlasov equation written in these 
coordinates is then 

= r ■ (5.764) 

where fi3 = /13/-D1 and where the terms C ■ dfis/dC are not present because 
C = 0. We can now integrate the above Vlasov equation over the C coordinates 
to get 

= F.^, (5.765) 

where 

fs{Y) = [ d'ChsiZ) 

dN{rj) [ dTd^{Y-y{Tj,r)), 



(5.766) 

and fg = fs/Di. This is obviously the same Vlasov equation that we would 
have obtained if we had used only the clearly nonpathological {R, K, B, fi, O) 
coordinates from the start. 

It turns out to be easier (for reasons that will become clear shortly) to write 
the Vlasov equation in terms of f and easier to write the field equation in terms 
of /. Since we know what D is, however, there is clearly no problem involved in 
writing both equations in terms of either / or f (recall that / and f are related 
by Eq. (j5. 74711 with D given by Eq. (|5.761|l l. 
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5.3.1 Constructing the System Action 

We begin by considering the case in which there is no eikonal wave field present. 
Our action one-form and Hamiltonian for a single guiding-center are thus given 
by Eqs. (|3.()U4jl and (|3.581|1 . respectively. In Section 15.51 we generalize our 
results to the case in which the plasma is bathed in an eikonal wave field. For 
now we construct the action for the coupled system of guiding-center plasma 
and Maxwell field. This has the form 

S = Sgc + Sm, (5.767) 

where Sgc is the total action of the guiding centers, and where Sm is the action 
of the Maxwell field. 

Now the action of the guiding centers is found by simply summing that for 
a single guiding center over the full distribution. Thus we write 



SgciZ.A,] = jdN{rj) j dT[rg,{Z{r],r);MR{rj,T)),F,{R{rj,r))) ■ Z{rj, 
- J2 UV. t)C, {Ziv, ry,F,{R{ri, r))) 



-if,4Z(r7,r);F,(i?(77,r))) 



(5.768) 



Here we have written Z for the full set of boostgauge and gyrogauge invariant 
guiding-center coordinates, {R,U, fi, a). We have enforced the constraints by 
means of Lagrange multipliers, using X^, to denote the multiplier for constraint 
C^, where the index u runs over all the constraints present as usual. Finally, we 
have indicated separately the functional dependence of the various terms on the 
four potential Ai and the background field Fi (here i denotes the ordering of the 
field as discussed in Section EHj). 



The Relativistic Guiding-Center Plasma 193 

Now Eq. ()5.7(j8j1 may be written in the form 

Sgc = I d^xCgc, (5.769) 



where x denotes spacetime position, and where we have defined the Lagrangian 
density for the guiding centers, 

Cgcix) = J dN{r,) J dTS\x~R{r],T))[Tg,{Z{r],r);A,,F,)-Z{r],r) 

- J2 UV. r)C, {Z{V. r); F,) - Hg,{Z{r], r);F,) . 

(5.770) 

Here we have adopted the convention that Ai and Fi denote Aiix) and Fi{x), 
respectively. 

The MaxweU action is well known to be (see, for example, Jackson |44j ) 

d^xC^ix), (5.771) 



where the Lagrangian density for the Maxwell field is 

C^ = --^{Fo + eF^ + ---): (Fq + eF^ + ■ ■ ■). (5.772) 
In this study, we shall retain terms in Cm only to order e; thus we write 

Cm = -^{Fo : i^o + 2eFo : Fi). (5.773) 

5.3.2 The Vlasov Equation for Guiding Centers 

We first vary the system action with respect to the particle field, Z(r], r). After 
a short calculation, we find 

SS 







5Z(r?,r) 

ng,{Z{r], T);A{R{r], r)), F,(i?(r?, r))) ■ Z(r?, r) 

1/ 

r) J-f 

'^Z{v,ry,F,{R{r^,r))). 



dZ 

(5.774) 
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where O^c = d^gc- This equation, coupled with the constraints 

a(Z(r7,r);F,(i?(77,r))) =5,3 (5.775) 

(which are needed to determine the Lagrange multiphers), shows clearly that the 
fields, Z{r],T), obey the usual equations of motion for a single guiding center. 
Knowing this, and using the ideas developed in the previous section, it is now 
possible to write down the Vlasov equation, 

■ af 

= Z.— , (5.776) 

using the equations of motion for a single guiding center. 

In particular, if we use the (i?, U, fx, O) coordinates, this becomes 

We can now define the guiding- center distribution function, 

f9{R,U,^) = / defio(i2,C/,/x,e). (5.778) 
Jo 

This is nothing more than 27r times the 0-average of the full distribution function 
fio- Now because Z is independent of G (thanks to our guiding-center transfor- 
mation) and because /i = 0, taking the G-average of the above kinetic equation 
yields 

This is the reduced kinetic equation for the guiding-center distribution function. 
5.3.3 The Field Equations 

Generally speaking, the idea is now to vary the above action with respect to the 
four potential to get the dynamical equations for the fields. This must be done 
carefully, however, as there are two additional constraints that such variation 
must respect. Recall that in our derivation of the guiding-center action we 
assumed that the background field scale lengths were large in comparison to 
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the gyroradius, and we assumed that the zero-order fields have A^; = 0. We 
must make certain that the dynamics of the fields do not evolve them into a 
configuration for which either of these assumptions are violated. In order to get 
dynamical equations for the fields that respect these constraints, our variation 
of the action with respect to the four potential must be a constrained variation; 
that is, arbitrary variations of the four potential are not allowed. Only those 
variations of the four potential that preserve the vanishing of A^; to lowest order 
and the smallness of the ratio of gyroradius to scale length are allowed. 

We thus begin our derivation of the field equations by examining the variation 
of the action due to variations of the A^, without assuming in any way that the 
variations of the Ai are arbitrary. Recall that we have indicated separately the 
functional dependence of the various terms in the action on the four potential 
Ai and the background field Fi. Of course, Fi = dAi, so when we vary with 
respect to the Ai we must take into account the Fi dependence. To do this, it 
is convenient to distinguish between total and partial functional derivatives with 
respect to Ai. We use the chain rule to write 




(5.780) 



To proceed, note that 





(5.781) 



so that 



SF,^,{x') _ 
5Aip{x) 



d^p[d^{x - x')] 



V 



(5.782) 



Using this in Eq. fl5.78()j1 . we get 




(5.783) 
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This formula is very useful in what follows. 

Using Eq. (15 .78311 to vary the action with respect to the four potential, we 
arrive straightforwardly at the following result: 

^S = j d^x [Jo [x) ■ Mo (x) + Ji (x) ■ Ml (x)] , (5.784) 



where we have defined 



and 



Jo(x) = -J(x) + -^V-G'o(x) (5.785) 

c 47r 



Ji(x) = -J(x) + -^V-G'i(x), (5.786) 
c All 



where in turn we have defined the guiding- center current density 

J(x) ^ c j dN{^) j dT5\x-R{r^,r))^{Z{r^,r)-A,,F,)-Z{^,r) 
= ^ j dN{rj) J dTd\x-R{7j,T))^{Z{7j,r);A,F,).Z{ri,T) 

= ^ J dR J dU J dn J dOfioiR.U, n,e)6^{x - R)R{R,U, n) 



dR j dU j dufgiR.U, n)5'^{x - R)R{R,U, n), 

(5.787) 



and the macroscopic field tensors 

Go{x) = Fo(x) + eFi(x) - 47rMo(x) (5.788) 

G'i(x) = Fo(x) - 47rMi(x), (5.789) 
and where in turn we have defined the guiding- center magnetization densities 

Mo(x) = 2 j dNiTj) j drd\x-R{rj,T))[-^^iZ{7j,r);A,F,)-Z{Tj,r) 

dFo'"''"'''^'^~ dFo 
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2 I dRjdU JdiJ J defio{R,U,^,e)6\x-R)[-^^{Z;A,,F, 



2 I dR I dU I dnfQ{R,U,fi)5'^{x- R) 

dH. 



dFo 
dV 



'Z;A,,F{)-Z{R,U,^i) 



(5.790) 



2 r r r (9r 

Mi(x) = -JdN{r])JdrS\x-R{r],r))[-^^{Z{r],T);A,F,)-Z{r],r) 



2 dR dU dfi rfe/io(i?,t/,;U,e)r(x-i?) — — (Z;A„Fi) 



■ZiR, U,^)-Y: A.^(^; i^.) - ^{Z; F.) 



2 dR dU dnfg{R,U,n)5''{x- R) -—{Z;A^,F,)-Z{R,U,ij) 



dFi 



J2X.^(Z-F)-^(Z-F 



dFi 



dFi 



(5.791) 



Note that the magnetization came from the second term on the right of 
Eq. ()5.7M3|1 . Also note that the only thing that depends explicitly on Fi is 
the first-order piece of the Hamiltonian, so that only the last term in square 
brackets in the above expression for Mi survives; of course, Fi also appears in 
the brackets due to the Ai dependence of Tgc- Finally note that we were able 
to write the current and the magnetizations in terms of the reduced Eulerian 
distribution function, fg. 

Now because the SAi are not arbitrary, we cannot simply set Jo = J\ = 0. 
Instead, as discussed above, we must restrict the variation so that it respects 
the constraints that A^; = to lowest order and that the ratio of gyroradius 
to scale length is small. To deal with the first of these constraints, let us tem- 
porarily introduce Clebsch variables for the fields. We define four scalar fields, 
a{x), /3{x), k{x), a"(x), such that in terms of these fields the four potential is given 
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by 

Aq = ad(3 (5.792) 

Ai = Kda, (5.793) 
and consequently the field tensor is given by 

Fo = dAo = d{ad(3) =daAd(3 (5.794) 

Fi = dAi = d{Kda) = dn A da. (5.795) 

That such scalar fields exist is guaranteed by the Darboux theorem. That is, 
because F is a closed two-form, it can be written in the form F = da A df3 + 
edn A da., where we are guaranteed enough freedom to choose a and (3 such that 
P|| ■ [da A dp) = 0. 

It is clear that the above construction insures that 

P|| ■ Fo = 0. (5.796) 

Note that we are ignoring Fi for z > 2, and that the parallel electric field must lie 
entirely within Fi. Thus, the specification of the four functions a{x), l3{x), k{x), 
and a{x) is a coordinatization of the function space of all electromagnetic fields 
that automatically ensures the satisfaction of the constraint that A^; = to 
lowest order. 

The variation of the action with respect to the four potentials may now be 
written 

5S = j d^x Jo ■ 5(aV/3) + Ji ■ 5{KVa) 

= j d^x{daJo ■ V/3 + a Jo ■ V5/3 + SkJi ■ W + ^^Jl ■ V5a) 

= I d^x 5a Jo ■ V/9 - 5/3 V ■ (a Jo) + 5k Ji ■ Va - 5aV ■ (kJi) . 

(5.797) 

We still cannot set the coefficients of the variations equal to zero, however, 
because of the remaining constraint that the fields remain sufficiently slowly 
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varying for the guiding-center approximation to remain valid. This point requires 
some discussion. 

Consider a general Fourier decomposition of the electromagnetic field in and 
around a plasma. We can divide the Fourier space into three regions. The first 
consists of slowly varying fields for which the guiding-center approximation is 
clearly valid; we call these background fields. The second consists of rapidly 
varying fields that are due to collective motion of the plasma; we call these wave 
fields, and their effect on a single guiding center was the subject of Chapter HI 
Note that wave fields violate the guiding-center approximation, and the only 
reason that we were able to treat them perturbatively was our assumption that 
their amplitudes are small. The third consists of the extremely rapid fiuctuations 
associated with collisions and higher correlations. 

Now fields belonging to the third region of Fourier space are clearly outside 
of the scope of this thesis; our Vlasov kinetic description of the plasma neglects 
correlations. Wave fields were studied in a single particle context in Chapter HJ 
and their self-consistent evolution will be studied in Section IK. 5L For now we are 
interested in the dynamics of the background fields. We thus define a projection 
operator, "P, that, when applied to an arbitrary field, projects out the part that is 
slowly varying. We shall not be specific about the nature of this operator except 
to say that, since it is a projection operator, we expect it to be idempotent. A 
moment's thought convinces one that this means that it must be a convolution 
of the field with a filter function whose Fourier transform is piecewise constant, 
having a value of either zero or one everywhere in Fourier space. Specifically, 
it has a value of one in the first of the above-described three regions of Fourier 
space, and a value of zero in the other two regions. Exactly how one draws these 
boundaries is what we are leaving unspecified. 

Thus, although we cannot set the coefficients of 5q;(x), 5/3(x), and 
da{x) equal to zero in Eq. fj5.797p . we can enforce the constraint that the fields 



200 



The Relativistic Guiding-Center Plasma 



are slowly varying by requiring that their variations be slowly varying; thus 

6a{x) = V5a{x) (5.798) 

(and similarly for the other three variations). We can also decompose the coef- 
ficients of the variations into slowly varying and rapidly varying parts; thus 

Jo ■ V/3 = V{Jo ■ V/3) + (1 - V){Jo ■ V/3) (5.799) 

(and similarly for the other three coefficients). Thus, upon multiplying Sa{x) 
and Jo ■ V/3, we get the product of the slowly varying terms and a cross term. 
Now the cross term is clearly oscillatory and vanishes upon integration over x. 
It is then legal to set the coefficients of the slowly varying parts of the variations 
equal to zero. This essentially means that we can set the projection of the 
coefficients of the variations in Eq. fl5.797|l equal to zero. 
Thus, we get 

V[Jo • V/9] = (5.800) 
V[V ■ (aJo)] = (5.801) 
V[Ji ■ W] = (5.802) 
^[V ■ (kJi)] = 0. (5.803) 
Now note that from Eq. ()5.787|1 . we have 



e 



dxf^ 

^4 



^R^' 



(5.804) 

where the last step follows from the fact that the delta function vanishes at the 
limits of integration for finite x. This result expresses conservation of particles. 
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or 



From this it follows that 

V ■ Jo = -V ■ = -V • J = 0. (5.805) 

e c 

So our field equations become 

V[Jo ■ V«] = V[Jo ■ V/3] = (5.806) 

V[Ji ■ Vk] =V[Ji- W] = 0. (5.807) 

Thus it follows that 

V[Jo ■ {VaVl3 - V/9Va)] = (5.808) 

V[Ji ■ (VAcVa - V^Vk)] = 0, (5.809) 

V[Fo ■ Jo] = (5.810) 

V[Fi ■ Ji] = 0. (5.811) 

Note that the Clebsch potentials have disappeared from our final result; this was 
essential since they have a gauge freedom and we expect our result to be gauge 
invariant. We simply used the Clebsch potentials to enforce our constraints, and 
then we got rid of them. 

The final results for the field equations are thus 

V[Fo-{^V ■Go + -J)] = (5.812) 
47r c 

P[Fi-(-3-V-Gi + -J)] = 0. (5.813) 
47r c 

Note that the first describes field evolution due to perpendicular four current, 
while the second describes field evolution due to parallel four current. 
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5.3.4 Summary of Self- Consistent Kinetic and Field Equa- 
tions 

To summarize the results of this section, we present the complete set of kinetic 
and field equations for the guiding-center plasma. The kinetic equation is 

0^«.§.^.§. (5.814, 

where R = {R,Hgc} and U = {U^Hgc}, and where in turn the Poisson brack- 
ets are given in Eqs. fj3.569|l through ()3 .57411 and the Hamiltonian is given in 
Eq. ()3.58ip . The field equations are then 

V[Fo-i^V-Go + -J)] = (5.815) 
47r c 

V[Fi-{^V-Gi + -J)] = 0, (5.816) 
where the current is given by 

j(x) = -^jdRjduj dijJg{R,U, n)5'^{x - R)R{R,U, n) (5.817) 

and the macroscopic field tensors are given by 

Gq{x) = Fo{x) + eFi{x) - 47rMo(x) (5.818) 

Gi{x) = Fo{x) - 47rMi(x), (5.819) 
and where in turn the magnetization densities are given by 

M„(x) = 2 j dR j dU j d^J,{R,U,ii)SHx-R)\^(Z;Ai,Fi)-Z{R,U,ii) 
Mi(i) = ijdRjduj dti],{R, U, ,i)S-'(x - R) [^{Z; Ai, Ft) ■ Z(R, U, y.) 

Of course, these must be supplemented by the homogeneous field equations, 

V ■ J^o = (5.822) 



The Relativistic Guiding-Center Plasma 



203 



V ■ -Fi = 0. 

Note that fgiR, U, and f9(-R, U, fi) are related by 

7Q{R,U,t,) = D{R)jQ{R,U,f,), 
where the Jacobian D is given by 

and where in turn T'{R) is given by Eq. (|3.575p . 

5.4 Conservation Laws for the Guiding-Center 
Plasma 

5.4.1 The Noether Method 

We now employ Noether 's theorem to deduce conservation laws for the energy- 
momentum and the angular momentum of the guiding-center plasma. The tech- 
nique has been described by Similon ^21: and we shall compare our results to 
his. We begin by considering the variation in the Lagrangian density due to the 
variation of all the fields. We start with C = Cgc + and apply the variation. 
Whenever terms involving the derivative of a variation appear, we replace them 
by a pure divergence minus a term for which the variation is not differentiated; 
this is almost like integration by parts, but since there is no integral sign, we 
must keep the pure divergence terms. When we are done, we shall find that 6C 
is equal to a pure divergence minus terms, for each field present, that consist of 
the variation of that field times the corresponding equation of motion. Thus, if 
we then use the equations of motion, we can reduce dC to a pure divergence. 
The algebra is tedious but very straightforward, and we get 

5£{x) = V ■ \^SAo-Go + -^SAi-Gi + Joa5p + JiK5a 

+ f dN{7]) f dT6\x-R{r],T))[R{r],r)Tg,{Z{r],ry,A,F,)-6Z{7],T) 



(5.823) 



(5.824) 



(5.825) 
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-5R{rj, r) {Tg,{Z{r), r); A„ F,) ■ Z{r,, r) 
- J2 r)a(Z(^, r); A„ F,) - H{Z{ri, r); F,))] 



(5.826) 



5.4.2 Conservation of Energy-Momentum 



To derive the conservation law for energy-momentum, we consider variations in 
the coordinates that effectively translate in spacetime all the particles of the 
plasma, the fields in the plasma, the external coils that generate the fields, etc. 
Following Similon ^2], we write these as follows: 

5R = ^ (5.827) 

5U = (5.828) 

5^ = (5.829) 

5a = 0, (5.830) 

where ^ is a constant vector. Thus, the particles' position coordinates are pushed 
forward without altering any of their other phase space coordinates. The fields 
translate according to the prescription 

5a = ~^- Va (5.831) 

513 = -C- V/3 (5.832) 
5k = -^- Vk (5.833) 
5a = -^- Vcr, (5.834) 



so 



5Ao = 5{a\/P) 

= 5a\/P + a\/5f3 

= ■ VaV/3 - a^ ■ VV/? 



The Relativistic Guiding-Center Plasma 205 

= ■ (VaV/3 + aVV/3) 
= ■ V(aV/?) 



(5.835) 



and similarly 



SAi = ■ VAi. (5.836) 
Finally note that the Lagrangian densities transform like scalar fields so 

5Cg, = ■ VCg, (5.837) 

5Cm = ■ VCm. (5.838) 
Inserting these into Eq. (I5.826|l . a short manipulation yields 

V ■ T = 0, (5.839) 

where we have introduced the stress-energy tensor 

T{x) = -^Go{x)-Fo{x)-^Giix)-Fi{x) + C^l 
An An 

+ J dN{r]) J dT5^{x - R{'q,T))R{r], T)mU{'q,T) 

= -^Go{x)-Fo{x)-^Gi{x)-Fi{x) + C^l 
An An 

+ JdRjduJdnJ defio{R,U, n,e)5^{x - R)R{mU) 

= -^Go{x)-Fo{x)- ^Gi{x)-Fi{x) + C^l 
An An 

-\- j dRj dU J diifQ{R,U, n)5\x - R)R{mU) 

(5.840) 

Eq. fl5.839|l expresses conservation of energy-momentum in the guiding-center 
plasma. Note that the last form for the stress-energy tensor given in Eq. ()5.840|1 
expresses the result in terms of the reduced Eulerian distribution function, /g. 
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5.4.3 Conservation of Angular Momentum 

To derive the conservation law for angular momentum, we consider variations 
in the coordinates that effectively rotate about the origin of spacetime all the 
particles of the plasma, the fields in the plasma, the external coils that generate 
the fields, etc. Following Similon ^2], we write these as follows: 

5R = n-R (5.841) 

5U = n-U (5.842) 

5^ = (5.843) 

5a = fl-a, (5.844) 

where Q is a constant antisymmetric second rank tensor. Thus, the particles' 
coordinates, i?, U, and d, transform like vectors undergoing an infinitesimal ro- 
tation. The fields rotate according to the prescription 



da — 


-(Q 


■ x) 


■ Va 


(5.845) 


5(3 = 


-(0 


■ x) 


■VP 


(5.846) 


5k = 


-(0 


■ x) 


■ 


(5.847) 


5a = 


-(0- 


x) 


■ Va, 


(5.848) 



so 



Mo = 5{aVP) 

= 5aVP + aV5/3 

= -(O ■ x) ■ VaV/3 + aV[-(0 ■ x) ■ V/9] 

= -(O ■ x) ■ (VaV/3 + aVV/9) + ■ V/3 

= -(O • x) ■ V(q;V/3) + ■ (aV/3) 

= -(O ■ x) ■ V^o + ■ ^0, 



(5.849) 
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Ml = -(O-x) ■ V^i +0- Ai. (5.850) 
Finally note that the Lagrangian densities transform like scalar fields so 

dCgc = -(O ■ x) ■ VCgc (5.851) 

SC„, = -(O • x) • V/:„,. (5.852) 
Inserting these into Eq. fj5.826|l . a short manipulation yields 



T-n-x + j dN{ri) J dT6''{x-R{r],T))R{r],T)Tg,^{Z{r], r); A, F,) ■ Q ■ a 
= 0, 

(5.853) 

where T is the stress-energy tensor given by Eq. fj5.840|l . Since O is the generator 
of an arbitrary rotation, this becomes 

W-{L + S)^ 0. (5.854) 

Here we have defined the third rank orbital angular momentum tensor 

j^ap^ = T"(^x^ - T"'^x^, (5.855) 

and the third rank spin angular momentum tensor 

5"^^ = j dN{r]) j dT5\x- R{r],T)) 

i?"(r7, r)[r^(Z(r7, r); A„ F,)a^ - FT (^(r^, r); A„ F,)a^] 
= J dN{r^) J drS\x - R{r^, r)) ^^^'l"^"!;^^!)) R'^iv, r)Ft{R{ri. r)) 

= j dR j dU j d^i j defio{R,U,iJ.,e)5^{x- R)^R''{R,U,^)fS^ 



dRJdUj diifQ{R,U,iJi)5^{x- R)j-ir{R,U,ij)F^^ . 



(5.856) 
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Eq. fl5.854|l expresses conservation of angular momentum in the guiding-center 
plasma. 

We pause to interpret our result for the guiding-center spin, Eq. ()5.856|1 . In 
a preferred frame, Fq"' = if either /3 = or 7 = 0, so we need consider only 
those components of S"'^'^ for which neither (3 nor 7 is zero, as all the rest vanish. 
Using Eq. ()3.314|1 for Fq in a preferred frame, we quickly find that 

S^^J = j dnj dU j d^Jg{R,U, ^)d^{x - R)e^R"{R,U, ^)e'^^hk, (5.857) 

where Latin indices run from one to three, as usual. Now in three dimensions 
one must take the three-dual of the angular momentum tensor to get the angular 
momentum vector. We can now do this for the last two indices of 5'°*-' . The first 
index is present because the relativistically covariant object is not the angular 
momentum itself, but rather its four flux. Taking the three dual, we flnd 

^ekijS'''^ = e j dR j dU j d^JgiR, U, ij)5^{x - R)ir{R, U, ij)ijhk. (5.858) 

Thus, to lowest order in e, when a = we get c times the spin density, which 
is the sum over the distribution of guiding centers of the vector with magnitude 
7„/U that points in the direction of b. Thus the spin angular momentum for a 
single guiding center in a preferred frame may be thought of as having magnitude 
7„/U and pointing in the direction of the magnetic fleld. For a = / 7^ 0, it is clear 
that we get the flux of this quantity, as the integrand has an additonal factor of 
f|l (to lowest order). This makes plausible our interpretation of S as the spin. 
Note that 

= T"^ — T"'*' x^ + T'^^ — T^^ 
— j'lP _ 

(5.859) 

where we have used Eq. fj5.839|) . Using this result, we can write the angular 
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momentum conservation law in the following form: 

T - T'^ + V ■ ^ = 0, (5.860) 

where the superscripted T means "transpose." Note that the antisymmetric part 
of the stress-energy tensor is equal to the divergence of the spin tensor. 

5.5 The Guiding-center Plasma in the Presence 
of an Eikonal Wave Field 

5.5.1 Constructing the System Action 

We are now ready to extend the above analysis to the situation for which the 
plasma is bathed in an eikonal wave field. The full four potential is now 

A{x) = Aq{x) + eAi{x) + AA^(x), (5.861) 

where the eikonal wave four potential 

) exp ^-'^(a;)^ + c.c. (5.862) 

was introduced back in Eq. ()4.618|1 of Section 14. 2L The corresponding field is 
then 

F{x) = Fo{x) + eFi(x) + AF^(x), (5.863) 



F^{x) = -F{x)exp ( -V'(x) ) +C.C. (5.864) 



where 



e \ e 

and 

F(x) =z(ki-ik) + e(vi-iv) (5.865) 

(the 0{e) term in F is usually neglected in the eikonal approximation). Note 
that Fq and Fi are slowly varying background fields, while F^ is the rapidly 
varying wave field. We must now construct the system action for a plasma of 
guiding/oscillation centers immersed in this field. The presence of the wave field 
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has two effects on the system action: It means that the Hamiltonian must now 
include the ponderomotive contribution, K2, and it means that the MaxweU 
action must now include the wave field. 

We first consider the effect on the Maxwell action. We form —F : F/IGtt, 
and note that it contains the product of the slowly varying terms, the product 
of the rapidly varying terms, and cross terms. The cross terms are oscillatory 
and vanish upon integration over x. The remaining Maxwell action is then 

Sm = {Sm)o + ^'^Sm-i (5.866) 

where {Sm)o is the functional form of the Maxwell action with no wave present 
(given by Eqs. ^^TTTllf and dHUZl), and 

is the contribution due to the wave. Thus the effective (averaged) Lagrangian 
density is 

C-m = {C-m)0 + >?C-m. (5.868) 

where {C.m)o is the functional form of the Lagrangian density with no wave 
present (given by Eq. ()5.773|] ). and 

Cm = -^F" : F (5.869) 

OTT 

is the contribution due to the wave. Note that Cm is quadratic in the field 
amplitude. 

We now consider the modification of the action due to the presence of the 
ponderomotive Hamiltonian. Replacing H hy H + X^K2 in Eq. ()5.768j1 . we see 
that 

Sgc = {Sgc)o + A'^c, (5.870) 

where {Sgc)o is the functional form of the guiding-center action with no wave 
present, and 

Sgc = - j d^x j dN{r)) J drS^ix - R{r],T))K2 (5.871) 
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is the contribution due to the wave. Also note that the Lagrange multiphers are 
altered by the introduction of K2 (recall that the Lagrange multipliers depend 
on the Hamiltonian). Thus X^, = {Xu)q + X^, where 

A. = 6-^, (5.872) 

and where the vectors $,1, were given in Eq. (|3.613|1 at the end of Chapter El 

Now K2 can be expressed as a real function of the wave field amplitude, F, 
thanks to its manifest gauge invariance. Specifically, examination of Eq. ()4.719j) 
shows that it is a real quadratic form in the wave field amplitude. Thus it can 
be written 

K2{Z; F„ F, k) = ^F:^/C«^^^(Z; Fo, k)F^„ (5.873) 

where the antisymmetry of the field tensor imparts the following symmetry prop- 
erties to /C: 

jQa/S^T] _ _i^l3ot^rj _ jQl3otrj£, _ _jQal3ri^ 874) 

and the reality of K2 implies 

/C"^^^ = (/C^^"^)*. (5.875) 

It is clear that a kernel, /C, with the above properties is defined implicitly by 
Eq. ()4.719p . Thus we can write 

Sgc = -l J d^x J dN{r^) j dr5^x-i?(77,r))F:^/C"^«''(Z;F„k)%. (5.876) 

If we now define the fourth rank generalized susceptibility tensor 

X"^«n^,[^,^^,k]) = j dN{r^) j rfr54(x-i?(r7,r))/C"^«''(Z;F„k), (5.877) 

(note that this differs from the more conventional definition of susceptibility by 
a minus sign) then we can put this in still more compact form, 

~Sgc = -\j d^^F* : x(x, [Z,F„k]) : F. (5.878) 
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Alternatively, we could write K2 as a quadratic form in the wave potential 
amplitude. Using F = i(kA — Ak), we find 

K2{Z;F,,A,k) = 2i:/C"^i5, (5.879) 

where the kernel 

/C"^ = k/3k^/C°^^^ (5.880) 

is a second rank tensor. Note that we denote it by the same symbol (/C) that we 
use for the fourth rank kernel; which is meant should be clear from either the 
context or the number of indices adorning it. The guiding- center action is then 

Sg^ = -2 j d^x j dN{ri) j dr5^{x - R{r],T))AllC"^{Z-Fi,k)A^. (5.881) 

We can then define the second rank susceptibility tensor 

x"«(a;,[Z,F„k]) ^ 2 j dN{rj) J drS^x - R{r^,T))lC^^{Z- F,,k) 
- 2k^k,x"^^^ 

(5.882) 

so that we may write 

Sg^ = - d^xA* ■ X ■ i. (5.883) 



Once again note that we have used the same symbol to denote the fourth order 
and second order versions of the susceptibility. 

The guiding-center Lagrangian density is then clearly 

Cgc{x) = {Cgc)o{x) + X'Cgcix), (5.884) 

where {Cgc)Q{x) is the functional form of the Lagrangian density when no wave 
is present, and 

Cgcix) = -^F* : x{x, [Z,F„k]) : F (5.885) 
is the contribution due to the wave. 
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The total action is thus 

S = {S)o + X^S, (5.886) 

where 

{S)o = {S^)o + {Sgc)o (5.887) 

and 

S = S^ + Sg, = -^ J d'^xF* : e{x, [Z, F„ k]) : F, (5.888) 
and where in turn we have defined the fourth rank generalized dielectric tensor 

e-\^^5\5^^ + Anx'"',^. (5.889) 

Alternatively, in terms of the wave potential amplitude, we have 

S^-^ [ d^xA* ■ V{x, [Z, F„ k]) ■ i, (5.890) 
47r J 

where we have defined the second rank dispersion tensor 

= k^^"^ - k"k^ + 47rx"5. (5.891) 
Similarly, the total Lagrangian density is thus 



where 



and 



C = (£)o + A^Z, (5.892) 



(£)o = {Cm)o + (£30)0 (5.893) 



_J_F* :£(x, [Z,F„k]) :F 
87r 

-l-A*:V{x, [Z,F„k]):i 



(5.894) 



The above action must be varied with respect to the particle coordinates and 
the fields as before, but now we must also vary it with respect to the wave fields. 
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dS 2i, fdS\ - fdS\ 

= ^ -2V-^ 5.895) 

total SA e UfJ Uf' 



A(x) and 'i/'(x). Note that the action depends on A only through its dependence 

on F, thanks to the manifest gauge invariance of K2; variation with respect to 

A win yield the dispersion relation for linear plasma waves. Note also that the 

action depends on ip only through its dependence on k = VfA, thanks to the 

averaging out of oscillating terms; thus '0 is an ignorable field coordinate, and 

variation with respect to it will yield the conservation law for wave action. 

Just as we found it useful to denote the dependence of a functional on Ai 

and Fi separately, we shall also find it useful to denote dependence on A and F 

separately. Using Eq. fj5.865|) . the analog of Eq. ()5.783|1 is easily found to be 

SS 
JI 

(in the eikonal approximation, the third term on the right hand side is usually 
neglected) . Similarly, we shall also find it useful to denote dependence on if; and 
F separately (note that F contains k which is the gradient of "0). Once again, 
we use Eq. (|5.865|1 to write 

6S_ 

# total e ' \" SF 

These results are very helpful in deriving what follows. 

5.5.2 The Vlasov Equation for Guiding/Oscillation Cen- 
ters 

It is straightforward to see that 



^-^V-fi*-^'). (5.896) 



6S _ (_SS_\ 



-A^ ^^'-%+^^^ Z(,,r);F.(i^(,,r)), 
F(i?(r7,r)),k(i?(77,r))), 

(5.897) 

where, as usual, we have used a subscripted to denote the functional form of 
a quantity when no wave is present. The above result yields the correction in 
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the equations of motion due to the presence of the ponderomotive Hamiltonian. 
Thus, the only modification to the kinetic equation due to the wave field is the 
inclusion of the ponderomotive effects of the wave field on the guiding/oscillation 
centers of the plasma. 

5.5.3 The Field Equations 

Next, we use Eq. fj5.783|l to take the functional derivative of S with respect to 
the Ai to get 

dS ( 6S 



5Ai \SAiy Q 



+ 2A^V ■ 



dN{ri) J dT5^{x-R{ri, r)) 



l^, :9/C(Zfar),F„k): ^ 
2 dR 



(5.898) 



Thus our field equation still follows from 

J d^xiJo ■ 5Ao + Ji ■ 5Ai) = 0, (5.899) 



but now: 



Jr = {Ji)o + 2A2v ■ [ j dN{7]) J dT6\x - R{ri, r)) 
l^, :9/C(Z(r7,r),F„k): ^i 

2 dF, y 

(5.900) 

Note that /C has no explicit dependence on Fi (the only effect of Fi is to alter 
the Poisson brackets), so only j7o is modified. This may be interpreted as a 
modification to the guiding-center magnetization density due to the presence of 
the wave field. That is, our field equations are still given by Eqs. fj5.812|) and 
dSSinD, but now 

M = {M)o-2X^ J dN{r)) J drS'^ix- R{r],T)) 



l-, :a/C(Z(77,r),F„k): - 
2 dF 



[5.901] 
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Note that the guiding-center current density is unaffected by the presence of the 
wave; this is due to our neglect of resonant effects. 

5.5.4 The Linear Susceptibility 

We now have two additional equations of motion due to the variations with 
respect to A and '0. First we consider the variation with respect to A. We use 
Eq. fl5.895jl . and in keeping with the eikonal approximation, we neglect the third 
term on the right. We immediately get 

= ^ = P (^k ■ e{x, [Z, F„ kD-.py (5.902) 

so 

V (k ■ £(x, [Z, Fi, VV^(x)]) : F{x)^ = 0. (5.903) 

This is the eikonal equation for linear plasma waves. To see it in a somewhat 
more familiar form, write F = i(hA — Ak), so after some straightforward ma- 
nipulation we arrive at 

V{V • i) = 0, (5.904) 
where we have used the dispersion tensor defined back in Eq. fj5.891|l , 

= k^^^. - k^kg + 87rk„k^x"^ 



k^^^ - k^k^ + 47rx^ 



(5.905) 



The dispersion relation for linear plasma waves is found by setting the eigen- 
values of the dispersion tensor equal to zero. In "three-plus-one" notation, the 
dispersion tensor is three by three and so it has only three eigenvalues that can 
be set to zero. It seems that we are finding an extra branch to the dispersion re- 
lation, and one might wonder why this should be so. By multiplying Eq. ()5.905p 
by k^, however, it is easy to see that k is a null eigenvector of V. Thus, the 



The Relativistic Guiding-Center Plasma 



217 



extra eigenvalue is null, so setting it equal to zero does not yield any new in- 
formation. The other three roots yield the more interesting information about 
plasma waves. 

5.5.5 Conservation of Wave Action 

We next consider the equation of motion obtained by varying -0. Using 
Eq. ()5.896|1 . we immediately find 

= — = V ■ J, (5.906) 

dtp 

where we have defined the wave action four flux 

J = vi -^A* ■e:F + ^F*--^F . (5.907) 
\27re Stt ok j 

Our equation of motion thus expresses the conservation of this wave action. 

The wave action takes on a much simpler form when written in terms of the 
dispersion tensor, defined in Eq. fj5.905|l . We find 

J = vi^A*-^A]. (5.908) 



V87r dk 

Finally note that the wave action is gauge invariant, although this is not 
manifest in either of the two forms presented above. To prove this, we replace 
A* by i* - zkA* in Eq. (|5.9()7|1 . Using the dispersion relation, Eq. (|5.9()3|1 . it is 
easy to see that the term involving A vanishes, leaving J' unchanged. 

5.5.6 Applying the Noether Method 

We now consider what happens to the conservation laws obtained by the Noether 
method when we include the effects of the wave field. In this case, Eq. ()5.82 
is altered in the following way: 

SC = {SC)o - V ■ {Si: J) - ^ V ■ [{e : F) ■ A*] + A^v ■ (M ■ SAo) 
+V ■ ly dN{7]) J dT5R{r), t)5^{x - R{r), r)) 
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yF*:/C(Z(r7,r);F„k):F}. 

(5.909) 

To derive this equation, we applied the variation to the fuU Lagrangian density 
for the guiding/osciUation-center plasma in the presence of the wave field. We 
noted that 

F = -kA + - (transpose), (5.910) 
e 

so 

SF = -kSA + -(V#)i + {VSA) - (transpose). (5.911) 

e e 

Finally, we used the equations of motion to simplify the result, just as we did 
for the case in which there was no wave field present. 

Note that the second term on the right hand side of Eq. fj5.910|l and the 
third term on the right of Eq. 1)5.91111 are usually neglected in the eikonal ap- 
proximation. They are similar in this respect to the third term on the right of 
Eq. (|5.895|1 . and the 0{e) terms of Eq. ()5.865|1 (which also must be included in 
the analysis leading to Eq. (|5.909jl ). Up until now, we have consistently neglected 
these terms in our analysis. It will turn out that they are also unneccessary in 
deriving the conservation law for energy-momentum, but they are necessary in 
the derivation of the conservation law for angular momentum in order to obtain 
the correct expression for the modification of the guiding-center spin due to the 
presence of the wave. 

5.5.7 Conservation of Energy-Momentum 

We now use the same translational variation of the system that we did in the 
case for which no wave was present, but now we add the variations of the wave 
quantities, 

5tp = -^-^ij = -^-k (5.912) 

and 

SA = -^- vi- (5.913) 
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There are five terms on the right hand side of Eq. fl5 .90911 . The fifth term 
cancels the portion of SCgc = ■ WCgc (on the left hand side) that is due to K2- 
The fourth term is the correction to the magnetization density due to the wave, 
as defined in Eq. (|5.79(Jjl . It will simply cause the magnetization density that 
appears in the conservation laws to be corrected for the presence of the wave. 
The third term is of the sort discussed above that may be neglected in the usual 
eikonal approximation. The new stuff comes from the second term, V ■ (i/k ■ ^), 
and from the portion of 6 Cm — ■ S/Cm (on the left hand side) that is due to 
the wave. 

The new stress-energy tensor is then 



where (T)o is the result with no wave field present (see Eq. ()5.84Up ). and T is 
the modification due to the wave. 



To recap, the first term on the right hand side above simply insures that the 
magnetization that appears in the stress-energy tensor is that corrected for the 
presence of the wave. The second term on the right hand side above similarly 
insures that the term Cm'^ that appears in the stress-energy tensor is also cor- 
rected for the presence of the wave. The third term is the stress-energy due to 
the wave itself. Note that it is the tensor product of the wave action with the 
four wavevector. This is sensible since the wave action may be interpreted as 
the number flux of wave quanta times some unit of action, and the unit of action 
times the four wavevector is the energy-momentum per quantum. 

5.5.8 Conservation of Angular Momentum 

Finally, we examine the law of conservation of angular momentum. We use the 
same rotational variation of the system that we did in the case for which no 



T = (T)o + A^T, 



(5.914) 



T = M-Fo + Cml + Jk. 



(5.915) 
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wave was present, but now we add the variations of the wave quantities, 

5tp = -{n-x)-Vip = -{^-x)-k (5.916) 

and 

6A = -{n- x) -VA + n- A. (5.917) 

Once again, we examine the five terms on the right hand side of Eq. fl5.9()9p . 
Now 5C = —{Q- x) ■ \/C = —V ■ [(^ ■ x)C], so once again the fifth term will cancel 
with the portion of SCgc (on the left hand side) that is due to K2. Similarly, it is 
straightforwardly shown that the fourth term causes the magnetization density 
that appears in the angular momentum tensor to be corrected for the presence 
of the wave, just as it did in the stress-energy tensor. The second term is 
V ■ [i7k ■ O ■ x] , and this contributes a new term in the orbital angular momentum 
tensor; so 

L = {L)o + X'^L, (5.918) 

where 

= f^P^i _ T"72./3_ (5,919) 

Clearly, this is the orbital angular momentum due to the wave. 

This time we retain the third term on the right hand side of Eq. fj5.909p . It 

is 

— V ■ \ [-(O ■ x) ■ vi* + O ■ i*] ■ £ : F i . (5.920) 
27r I J 

We shall still ignore the first term in square brackets, as it contains a gradi- 
ent of the wave field amplitude, but we retain the second term. After some 
manipulation, it becomes 

^[0;3,(i-*£^VF^- - A^^e-'^^F'^nU. (5.921) 

From this we can identify a correction to the spin angular momentum tensor. 
We write 

S={S)o + X^S, (5.922) 
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where 

= ^7*/"^^^^*^ _ A^*e^"^^F^''. (5.923) 

This is the correction to the spin angular momentum tensor of a 
guiding/osciUation-center plasma due to the presence of an eikonal wave field. 
This quantity is given by Soper [301 for oscillations in an electromagnetic field 
in a vacuum. He writes 

S^fjf = i7*^/3a _ i/3*p'7«_ (5 924) 

(see his Equation (9.3.14)). If we set the susceptibility in Eq. ()5.889p equal to 
zero, and plug the resulting vacuum dielectric into Eq. (|5.923|1 . it is clear that 
our result will reduce to Soper's. Thus, our result may be considered to be an 
extension of his result to the case of dielectric media. 

The lack of gauge invariance of our result for S is disturbing and will be 
discussed further in Chapter IHl 
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Chapter 6 

Questions for Future Study 



In this chapter, we discuss some questions raised by this study that could be 
topics for future research. These are in no particular order. 

• The neglect of resonant effects is probably the most glaring omission of 
this thesis, and probably that most likely to limit its utility. There are 
several schools of thought on how to deal with resonant effects, but they 
break down into two major categories: 

First, there are attempts to simply "patch up" the nonresonant treatment: 
For example, since our nonresonant treatment has successfully given us 
the hermitian part of the susceptibility tensor, we could use the Kramers- 
Kronig relations to get the antihermitian part. Alternatively, we could 
simply dictate that all resonant denominators are to be treated according 
to the Landau prescription. These methods, while successful in describing 
resonant particle effects on plasma waves, fall far short of a unified descrip- 
tion of the effects of resonant particles. Furthermore, there is something 
aesthetically displeasing about tricks of this sort. 

Second, there are attempts to go back and redo the single particle analyses 
to include resonant effects. The general idea is that we first went astray 
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when we said that we could transform away the first order part of the 
action due to the eikonal wave field. While we can certainly do this far 
away from the resonant regions of phase space, we certainly cannot do this 
at (or even near) the resonance itself. So we should go back and retain 
the first order part of the action in the region of phase space near the 
resonance. Like the first technique, this approach explains certain things 
nicely, but falls short of a unified description of resonant particles. For 
example, the first order action that we retain will depend on the four 
potential of the wave, and this will yield a modification to the current 
density of a guiding-center plasma that is immersed in a wave field; this 
is the current drive due to a wave field that tokamak researchers study. 
On the other hand, a good description of how this residual piece of the 
first order action gives rise to Landau damping does not seem to exist. 
Furthermore, there is a great deal of arbitrariness connected with how to 
decide just how much of this first order action to keep. One approach 
uses "window functions" of some characteristic width, but there is a great 
deal of freedom in just how these window functions should look (square 
windows, gaussian windows, etc.); Dewar j3T] gives a variational principle 
for determining optimal window shape, but then we have to worry about 
just what we mean by "optimal." There is also a great deal of freedom 
in choosing the width of such windows. If we try to transform away the 
first order action too close to the resonance, problems develop due to the 
presence of the trapped particles, and the transformation ceases to be a 
near-identity diffeomorphism. Unfortunately, it is hard to quantify what 
we mean by "too close" in this regard. Perhaps the window width should 
itself be treated as a dynamical variable whose dynamics are given by some 
variational principle (like that of Dewar) ; this might be a useful tool for the 
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study of "resonance broadening" effects, where the width of the resonant 
region varies in time. 

• Pursuing the osciUation-center Lie transforms to higher order is a natural 
and obvious extension of this thesis. In this way, one could study induced 
scattering and three-wave phenomena. Past attempts to study these have 
either not used systematic perturbation theory (e.g. Lie transforms), or 
have used Hamiltonian methods without manifest gauge invariance. This 
thesis should provide the tools needed to combine the desiderata of sys- 
tematic perturbation theory and manifest gauge invariance. Central to 
this effort has been the use of the homotopy formula, and the introduction 
of the pair of special functions, Qe and Re. 

It is interesting to note that this same program could have been carried out 
for the nonrelativistic problem. One must simply take the perturbation to 
the action due to the wave (for which there now would be both a vector 
and a scalar potential), and apply to it the guiding-center Lie transform, 
using the homotopy formula in the same way that we did here. 

• The inclusion of dissipative effects (collisions, correlations, etc.) would 
be an important generalization of the work presented here. This is un- 
doubtedly related to the problems associated with the inclusion of resonant 
effects. A unified treatment of correlations would yield the appropriate col- 
lision operator in the kinetic equation, and modify the energy-momentum 
conservation law to describe the flow of energy into heat. 

One way to approach this subject might be through the extended use of 
projection operators. We employed this technique in Chapter El to show 
that it was possible for energy-momentum and angular momentum to flow 
from one relevent region of Fourier space to another irrelevent one, and 
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thereby to effectively appear as a source term in the conservation laws. 
We did not pursue this idea of partitioning Fourier space into one zone 
for background fields, one zone for wave fields, and one zone for effects of 
collisions (for example, we never introduced a second projection operator 
for the wave fields, or a third one for fields arising in collisions). This 
approach may prove useful, but it quickly leads to great complication in the 
procedure, and it is not clear how it might give rise to collision operators, 
etc. 

• When we applied the Noether method to the action to obtain the guiding- 
center spin angular momentum, we used the version of the action that was 
both boostgauge and gyrogauge invariant. There is a good reason why we 
did this. Other versions contain the quantity 71 that was introduced back 
in Chapter |S1 If we had tried to apply Noether 's theorem to an action 
containing 7^, we would at some point have been faced with the question 
of how to vary TZ with respect to the four potential. It seems that TZ is 
not independent of the four potential since it was defined in terms of the 
unit vectors, e^, and these, in turn, depend upon the background field. 

We dodged the issue by going to the boostgauge and gyrogauge invariant 
coordinates for which TZ does not appear in the action, but it is inter- 
esting to contemplate the alternatives. If we were to simply ignore this 
term, we would not get guiding-center spin, and that would be unaccept- 
able. Though we had to go to higher order to find this term in our first 
derivation of the guiding- center action, it has the same order as the iJ,d9 
term which is obviously critically important. Indeed, now that we have the 
benefit of hindsight, we see that we could have avoided the higher order 
guiding-center Lie transform altogether by examining the action at clas- 
sical order and asking what we would have to add to it to make the fidO 
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term gyrogauge invariant. The answer would have been —{iTZ ■ dR, and 
this was really the only important term we found at higher order. Thus, 
the clever application of a gauge invariance requirement can save one from 
going to higher order in a perturbation calculation! 

So, since we can't ignore this term, how else could we have dealt with it? 
There are a couple of possible avenues of approach. First, recall the well 
known result that the stress-energy tensor is given by the derivative of the 
Lagrangian density with respect to the metric tensor (this is true at least 
for spinless systems). There seems to be an analogous theorem (or, at least, 
a conjecture) enunciated by Hehl [32], that the spin angular momentum 
tensor is the derivative of the Lagrangian density with respect to torsion. 
Torsion is the result of an asymmetric afhne connection, and the afhne 
connection that we had to introduce in Section 13.111 to explain the TZ ■ R 
term in G is indeed asymmetric. Now it is not clear to me that 7^ is a 
torsion, but these remarks do make it clear that TZ has at least something to 
do with torsion. In any event, TZ appears in our guiding-center action with 
a /U in front of it, so it is possible that we could apply the above theorem 
(conjecture?) and derive guiding-center spin directly (without recourse to 
Noether's theorem). I suspect that, if this were possible, it would be of 
more interest to researchers in quantum gravity (which is the community to 
whom reference jH^] was aimed) than it would be to researchers in plasma 
physics. It may be that guiding-center motion provides a unique classical 
forum within which this topic of current research in the field of quantum 
gravity may be applied, tested, and better understood. 

Another possible approach to the spin problem is yet more speculative. 
It is suggested by the minimal coupling idea of gauge field theory. Recall 
that 71 is the gauge potential associated with the gyrogauge group. In 
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Section nil H we even went one step further and derived the corresponding 
gauge field, Af. Using the techniques of gauge field theory, it might be 
possible to use TZ to define a gauge covariant derivative. We could then 
add something like A/" : A/" to the Lagrangian density, and treat A and 
TZ as independent gauge fields. Though these ideas are suggested by the 
analogy with gauge field theories, they would all have to be rigorously 
justified. Furthermore, it is not obvious how guiding-center spin would 
arise from these considerations. 

• Another mystery that should be mentioned is the apparant lack of gauge 
invariance of the wave modification to guiding-center spin. Our result 
is clearly the extension to dielectric media of Soper's result for the vac- 
uum inO! • The lack of gauge invariance did not seem to bother him, except 
for a cryptic footnote that indicates that the result is invariant with respect 
to a certain subgroup of the full gauge group. One possible explanation 
might be that the division of angular momentum into orbital and spin con- 
tributions is not a gauge-invariant division. If this were the case, however, 
one would expect that neither the orbital nor the spin angular momentum 
should be gauge invariant by itself, but that their sum should be gauge in- 
variant. Alas, the orbital angular momentum seems to be gauge invariant 
all by itself, so the issue remains a mystery. 

• It would be nice to find a Hamiltonian field theoretical formula- 
tion of the kinetic and field equations for the guiding-center and the 
guiding/oscillation-center plasma. Manifestly covariant Hamiltonian field 
theories are, however, tricky to formulate. We cannot give preference to 
the time variable, and the proper time is not uniquely defined (every par- 
ticle in the system has its own proper time) . There may be ways of getting 
around this difficulty by generalizing the form of Hamiltonian equations 
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of motion for such systems. If this could be done, it might be possible 
to use the energy-casimir method to study plasma stability to nonlinear 
perturbations. 

• We have developed conservation laws for energy-momentum and angular 
momentum for the guiding/oscillation-center plasma. In most studies of 
plasma dynamics, use is made of energy conservation, but not of momen- 
tum or angular momentum conservation (of course, in a covariant rela- 
tivistic treatment energy and momentum are inseparable). It is possible 
that these conserved quantities could play a far greater role in the study 
of, say, plasma stability theory than they have until now. For example, the 
Lyapunov method for assessing stability rests heavily on the discovery of 
conserved quantities. Just how to go about doing this is not immediately 
clear. 
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Appendix A 



Glossary of Notation 



In this appendix, we list all the important symbols used in this thesis, giving the 
number of the equation where they were first used (if appropriate) and a brief 
description (if appropriate). 



SYMBOL EQUATION DESCRIPTION 



a{x) (j5.794|) Clebsch potential for field 

a ()3.59ip Gyrogauge-invariant coordinatization of gyroan- 

gle 

/? ()3.34ip Angular hyperbolic polar coordinate for parallel 

part of particle four velocity 
/3{x) (|5.794|) Clebsch potential for field 

I3e ffT^THnj) E X B/S2 

/3, fnOK7l> v/c 

Tgc (j3.489p Guiding-center action one form 

7 Action one-form 

7„ Relativistic gamma factor: 7„ = 1 / a/ 1 — . 

S°'a Kronecker delta 
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e Guiding-center expansion parameter 

e'^^'"'^^ Levi-Civita tensor in n dimensions 

e"^^^ ()5 .88911 Generalized dielectric tensor 

T] (|5 .73711 Continuous particle label 

B ()3.485p Angular polar coordinate for perpendicular part 

of guiding-center four velocity 

6 (|3.341jl Angular polar coordinate for perpendicular part 

of particle four velocity 

k{x) (|5.795jl Clebsch potential for field 

A ()4.62U|1 Oscillation-center expansion parameter 

Ai ()3.287|1 Lorentz scalar for electromagnetic field 

A2 ()3.288j1 Lorentz pseudoscalar for electromagnetic field 

Ay ()3.6U4|1 Lagrange multiplier 

Xb ()3.293|1 Related to eigenvalues of F 

Xe ()3.292jl Related to eigenvalues of F 

jji ()3.488|1 Gyromomentum 

V Constraint label 

^ ()5.827|1 Generator of infinitesimal translation in space- 
time 

e mm 

cr(x) ()5.795|1 Clebsch potential for field 

T Proper time 

x"^^^ ()5.877jl Generalized susceptibility tensor 

if) ()4.618p Phase of eikonal wave 
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Generator of infinitesimal rotation in spacetime 

Gyrofrequency with respect to proper time 

Guiding-center Lagrangian two form 

Lagrangian two-form 

Four-vector potential 

Zero-order four-vector potential 

First-order four- vector potential 

Eikonal wave potential 

Amplitude of eikonal wave potential 

Three-vector potential 

Member of orthonormal basis tetrad 

Magnetic field pseudovector 

Angular hyperbolic polar coordinate for parallel 

part of guiding-center four velocity 

Unit three-vector in direction of magnetic field 

Member of orthonormal basis tetrad 

Constraints 

Speed of light 

Member of orthonormal basis tetrad 
Jacobian 

Dispersion Tensor 
Resonant denominator 
Charge 

Electric field vector 
Field tensor 



Zero-order field tensor 



Glossary of Notation 



First-order field tensor 
Eikonal wave field 
Amplitude of eikonal wave field 
Dual field tensor 

Pseudoscalar Eulerian particle distribution func- 



Scalar Eulerian particle distribution function 
Pseudoscalar Eulerian guiding-center distribu- 
tion function 

Scalar Eulerian guiding-center distribution func- 
tion 

Metric tensor 

Macroscopic field tensor for perpendicular cur- 



Macroscopic field tensor for parallel current 
Hamiltonian 

Guiding-center Hamiltonian 





Interior product with respect to vector field g 
Four-current density 
Guiding-center poisson tensor 



Wave four vector 

Radial hyperbolic polar coordinate for parallel 

part of guiding-center four velocity 
Ponderomotive Hamiltonian 

Kernel of ponderomotive Hamiltonian 
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k (|3.485p Radial hyperbolic polar coordinate for parallel 

part of particle four velocity 



L ()5.855j) Guiding-center orbital angular momentum ten- 
sor 

L ()5.919p Wave contribution to guiding-center orbital an- 
gular momentum tensor 

Lgc fl3.582p Guiding-center Lagrangian 

jCg (j2.43p Lie derivative with respect to vector field g 

Cm (|5.773p Lagrangian density of Maxwell field 

Cm (j5.869p Lagrangian density of eikonal wave field 

£ ()4.639p Index for Fourier expansion in gyroangle 

Mo fl5.79()p Magnetization density tensor for perpendicular 

current 

Ml (j5.79ip Magnetization density tensor for parallel current 

M fl3.,^17p Boostgauge field 

m Mass 

dN{r]) pbu Measure of particles with labels between 77 and 

T] + dr] 

M fpm^ Gyrogauge field 

P|| (j3.3U4p Parallel projection operator 

Pj_ f)3.3()5p Perpendicular projection operator 

V (j5.798p Smoothing projection operator 

Qi fj4.(143|) Special Function 

Q fpOHip 

r Particle spacetime position 

Rl (|4.(i44p Special function 

n fpo?^ 

3? The set of real numbers 

S (j5.856p Guiding-center spin angular momentum tensor 
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s ffpn?^ 

mm 

Sm mm 
Sm mm 

t (pmn 

u mm 
mm 

V 



z 



Glossary of Notation 

Wave contribution to guiding-center spin angular 

momentum tensor 
Guiding-center action 

Maxwell action 

Maxwell action due to eikonal wave 
Guiding-center stress-energy tensor 
Wave contribution to guiding-center stress- 
energy tensor 

Member of orthonormal basis tetrad 

Boostgauge-invariant coordinatization of 

guiding-center parallel velocity 
Particle four-velocity 

Three- velocity 

Radial polar coordinate for perpendicular part of 

guiding-center four velocity 

Radial polar coordinate for perpendicular part of 

particle four velocity 
Spacetime coordinates 

Generic coordinates 

Double index contraction: A : B = A^^B^'^ . 



Appendix B 

Vector Spaces, Dual Spaces, 
Algebras, and Modules 

This appendix is included to establish the set-theoretical foundations of tensor 
calculus and exterior algebra, as these ideas are used extensively in this thesis. It 
is intended to provide a review for people already familiar with these topics, and 
to establish notation. The reader is expected to be familiar with linear algebra 
and with the topology of the real numbers. If anything herein is unfamiliar, the 
reader is urged to consult one of the above-mentioned introductory references. 

We begin with some set-theoretical notation: Given two sets, A and B, we 
define the Cartesian product, A x B, to be the set of all ordered pairs, (a, 6), 
such that a E A and h E B. The symbol V is read "for all," and the symbol 3 
is read "there exists." A set is said to be partitioned if there exist subsets such 
that each and every element of the set is a member of one and only one subset. 
A map that associates an element of a set, B, to each element of a set. A, is 
denoted hy A^ B. 

A relation, i?, among the elements of a set. A, is defined to be a subset of 
A X A; we write R d A x A. Two elements of A, say ai and a2, are then said 
to be related if (01,02) G R. In this case, we may write ai ~ 02- A relation is 
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reflexive if a ~ a for all a G A. A relation is symmetric if a ~ 6 implies b ^ a 
for all a, 6 G A. A relation is transitive if a ~ 6 and h c implies a ~ c for 
all a,h,c E A. A relation that is reflexive, symmetric and transitive is called 
an equivalence relation. An equivalence relation naturally partitions a set into 
subsets called equivalence classes. Any two members of the same equivalence 
class are related to each other by the equivalence relation, and members of 
different equivalence classes are not related by the equivalence relation. For 
example, the equivalence relation of "similarity" partitions the set of all triangles 
into an infinity of equivalence classes, and the equivalence relation of "equality 
modulo three" partitions the set of integers into three classes. The relation "is 
the same height or taller than" is not an equivalence relation on the set of all 
trees, because, although it is reflexive and transitive, it is not symmetric, etc. 

The set of all real numbers will be denoted by 3?. The set of all n-tuples of 
real numbers will be denoted by K"^, and the reader is assumed to have some 
familiarity with its usual topology. In particular, by using, say, the Euclidean 
norm, it is possible to deflne open sets as neighborhoods, and thus to have a 
concept of nearness, continuity, convergence, etc. 

Let V be a set with t/, ]/, W, . . . G V, and let a, 6, c, . . . G 3?. Let + denote an 
operation that takes two elements of V and returns a third one; that is, + is a 
map V X V V. Let ■ denote an operation that takes an element of 3? and an 
element of V and returns an element of V; that is, ■ is a map ^ x V ^ V. Then 
V is a vector space over the fleld of real numbers if and only if the following 
conditions hold: 

Condition B.0.8.1 \/U,V,W e V : U + {V + W) = {U + V) + W. 
Condition B.0.8.2 \/U,V e V : U + V = V + U. 
Condition B.0.8.3 30 G V : G V : F + = 
Condition B.0.8.4 \/U E V : 3V E V : U + V = 0. 
Condition B.0.8.5 Va, 6 G 3?, C/ G V : (ab) -U = a-{b-U). 
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Condition B.0.8.6 \fa,b e %U e V : {a + b) ■ U = a ■ U + b ■ U. 
Condition B.0.8.7 "ia e ^,U,V e V : a ■ {U + V) = a ■ U + a ■ V. 
Condition B.0.8.8 WU e V : 1 ■ U = U. 

A set of vectors, C/i, . . . , C/^, is said to be linearly independent if and only if 
the only real numbers, ci, . . . , c„, satisfying 

Ci-Ui+---Cr,-Ur, = (B.925) 

are ci = ■ ■ ■ = Cn = 0. Otherwise, the vectors are said to be linearly dependent. 
The number of elements in the largest possible set of linearly independent vectors 
is called the dimension of the vector space. If a vector space has dimension n, 
then any set of n linearly independent vectors constitutes a basis for that vector 
space. li Vi, . . . ,Vn is a, basis for V, then any vector, U, in V can be expressed 

U = ai-Vi + ---an-Vn, (B.926) 

where the real constants, a, are uniquely determined by U, and can be computed 
by standard techniques of linear algebra. In this case, we say that the basis spans 
the vector space. A vector subspace of a vector space, V, is a subset of V that is 
itself a vector space closed under ■ and +. The dimension of the vector subspace 
is the minimal number of basis vectors needed to span it. 

Vector spaces can be finite or infinite dimensional. An example of an infinite 
dimensional vector space is the space of all infinitely differentiable (C°°) real- 
valued functions on 3?. The addition and multiplication operations are then 

{f + 9){x) = f{x)+g{x) (B.927) 

and 

(a-/)(x)=a-/(x). (B.928) 

This very important space will be called A(3?). A basis for this vector space 
would have to contain an infinite number of elements; the theory of Fourier series 
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provides an example of how to go about constructing and using such bases. The 
set of aU polynomial functions of a real argument is a vector subspace of A(3fJ). 

A functional, U*, operating on a vector space, V, is a map V ^ ^. Equiva- 
lently, we can think of functionals as objects which pair with vectors to yield real 
numbers. The notation for this pairing is {U* , V) G 3?. Note that we frequently 
denote functionals with superscripted stars. It is possible to define operations of 
addition and real number multiplication on the space of functionals as follows: 

{U* + V*,W) = {U*,W) + {V*,W) (B.929) 

and 

{a-U*,W) =a{U*,W). (B.930) 

It is readily verified that these operations make the space of all functionals 
operating on V into a vector space which we shall denote by V*, and which we 
shall call the dual space to the vector space, V. Furthermore, it is also readily 
verified that the dimensions of V and V* are equal. An example of this from 
linear algebra may be instructive: The dual space to the vector space of column 
vectors may be identified with the vector space of row vectors, since a row vector 
and a column vector pair to yield a real number under matrix multiplication. 

If a vector space, V is endowed with a further bilinear operation that maps 
V X V 1-^ V, then it is called an algebra. Since this operation pairs vectors with 
other vectors, it can be written in the form {U, V) G V. By "bilinear," we mean 

{a-U + b-V,W) = a- {U,W) + b- {V,W) (B.931) 

and 

{U,a-V + b-W) = a - {U,V) + b- {U,W). (B.932) 

An algebra is commutative if Vt/, V E V : {U, V) = {V, U). An algebra is asso- 
ciative if yU,V,W eV: {U, {V, W)) = {{U, V),W). The set of real numbers, JJ, 
becomes a commutative, associative algebra when equipped with the operation 
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of multiplication of real numbers. The space A(3fJ) described above is also a 
commutative, associative algebra if we equip it with the multiplication 

ifg){x) = f{x)g{x). (B.933) 

In linear algebra, the set of all n by n square matrices is a vector space of 
dimension with the usual definitions of matrix addition and multiplication 
by real numbers; it becomes an associative (but not commutative) algebra when 
equipped with matrix multiplication. 

An algebra, V, is called a Lie algebra if and only if it is anticommutative 

Vt/, y G V : {U, V) = -{V, U), (B.934) 

and satisfies the Jacobi identity 

WU,V,WeV: {U, (V, W)) + {V, {W, U)) + {W, {U, V)) = 0. (B.935) 

The space of vectors in 3?^ becomes a Lie algebra when equipped with the usual 
cross product. 

A vector subspace of an algebra is called a subalgebra if it is closed under the 
algebra's multiplication rule. For example, the space of all polynomial functions 
of a real argument is a subalgebra of A(3?). A subalgebra of a Lie algebra is 
called a Lie subalgebra. 

We can generalize the concept of a vector field somewhat by relaxing the 
requirement that a and b in Conditions IB. 0.8. 51 through IB. 0.8. 81 above are real 
numbers. Suppose instead that they are members of any associative algebra, A. 
Then Conditions IB . . 831 through IB . . 8?8l still make sense, though the number 1 
that appears in Condition IB. 0.8.81 must be reinterpreted to refer to the identity 
element of the algebra, A. In this case, V is said to be a module over the algebra, 
A. For example, in linear algebra, the space of column vectors is a module over 
the above-described algebra of square matrices. 

Given an algebra, V, with subspace, U, we say that U is an ideal of V if and 
only if {U, V) eU and (F, U) eU for all C/ G W, and F G V. For example, let 
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V be the vector space of all polynomial functions of a real argument, x; Recall 
that this is a subalgebra of A(3?). Then, the subspace, W C V, of all polynomials 
with zeros at some particular location(s) is an ideal of V. 

Throughout this thesis, when a scalar multiplies a vector, the dot is sup- 
pressed; that is, a ■ V is written simply aV. The dot notation is used for other 
things. Also, boldface type is used to denote a vector, though its components in 
a given coordinate system are denoted by the same letter in ordinary typeface 
(with a superscripted index to label components). 
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Gyrofrequency Shift for 
Two-Dimensional 
Nonrelativistic 
Guiding-Center Motion 

As a straightforward but nontrivial example of the vector Lie transform tech- 
nique, we consider two-dimensional nonrelativistic guiding-center motion in a 
magnetic field of the form 

B^B{x,y)z, (C.936) 

and a perpendicular electric field of the form 

E^E^{x,y)5^ + Ey{x,y)y. (C.937) 

To lowest order, the gyrofrequency is given by O = eB/mc. We shall address 
the problem of computing the correction to this quantity due to the spatial 
dependence of B and E. 
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The single-particle equations of motion are 



X = u 



y = V 
g 

m 

V = —Ey-nu. (C.938) 
m 



Introduce the perpendicular velocity and the gyroangle, 



w = a/w^ + f ^ 

e = arg(-t)-m), (C.939) 

so that 

u — —wfiinO 

V = -wcose. (C.940) 

In terms of w and 9 the equations of motion are found to be 

X = —w sin 9 

y = —wcos9 
g 

w — (E^ sin 9 + Ey cos 9) 

m 

9 = -n- (E^cos9 - Eysm9) (C.941) 
e mw 

Here we have introduced the formal ordering parameter e, and have ordered the 
equations of motion by the prescription e ^ e/e and E i— »• eE. 

Though it is most useful and quite elegant to treat this problem with Hamil- 
tonian perturbation theory, we shall instead use Lie transforms directly on the 
dynamical vector field. We do this for the purposes of illustration. In Chapter IHl 
of this thesis, we treat the much more general problem of relativistic guiding- 
center motion in arbitrary electromagnetic field geometry in space-time (includ- 
ing perpendicular electric fields that may be order unity in the guiding-center 
expansion parameter, e), and there we make full use of the Hamiltonian na- 
ture of the equations of motion and we spend a great deal of time studying the 
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associated Poisson structure. It is useful to compare the two approaches. 

We denote the phase-space coordinates by z = (x, y, w, 9), and the equations 
of motion by 

z = lvo+Vi, (C.942) 
e 

where the dynamical vector field is described by 

Vo^ = 

yy = 
V = 

= n (C.943) 



and 



= -w sin 9 

Vi = —w cos 9 



= - — (E^ sin 9 + Ey cos 9) 
m 

Vf = —(E^cos9-EySm9). (C.944) 

The unperturbed problem, z = Vq / e, thus has the solution 

X — Xq 

y ^ yo 

W = Wo 

9 = 9o + nt/e, (C.945) 

so that averages over the unperturbed motion are equivalent to averages over 9. 
At first order, Eq. (|2.222|1 tells us that 

Vi = Vi - CiVo, (C.946) 

where V denotes the Lie transformed dynamical vector field at first order. The 
separate components of the above equation are then 
d 

^g^9i = V^ + wsin9 
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d 

^'^di = Vf+wcos9 
of) 

^^9i = Vr + sin ^ + cos ^) 

oO m 

^^al = Vf + ^{E,cos9-EySm9)+g^n,,+g\n,y. (C.947) 

We demand that the generator vector gi be purely osciUatory (single-valued in 
9). Thus, averaging the above equations immediately yields 

Vi = 0. (C.948) 

Then, we can solve Eqs. f)C.947|l for the components of gi. We get 

g, = --COS9 

y ^ ■ a 

91 = Qsm6' 

Qi = ^ i-E, cos 9 + Ey sin 9) 

gl = -^{E^sm9 + EyCos9)-^^sm9-^^cos9. {CM9) 

Thus we have completely removed the perturbation in the dynamical vector field 
at first order. The guiding-center equations of motion will appear at the next 
order, as will the desired correction to the gyrofrequency. 
At second order, Eq. (|2.223|1 tells us that 

V2 = -C2V0 - CiYi + ^£?Vo = -C2V0 - (C.950) 

The generator g2 must be chosen so that V2 is purely averaged. Thus, without 
having to actually compute g2, we can deduce 

V2 = /-^£iViV (C.951) 



2 

To get the shift in gyrofrequency, we need only V|. Because both Vi and gi 
contain oscillatory terms, the Lie derivative of one with respect to the other will 
contain products of oscillatory terms, and some of these will not average to zero. 
After some tedious algebra, we find 
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This is the gyrofrequency shift. The first term is the shift due to the spatial 
dependence of the perpendicular electric field, and the second term is the shift 
due to the spatial dependence of the magnetic field. The first of these terms was 
discovered by Kaufman j^Tj in 1960, who also showed that it gives rise to the 
phenomenon of gyroviscosity. 

It is interesting to note that, when the results of Chapter |31 are cast into 
"1 + 3" notation and the nonrelativistic limit is taken, the first of the above pair 
of terms is present but the second is not. This is because the ordering scheme 
used is quite different. In this appendix, we treated the perpendicular electric 
field as an order e quantity, whereas in Chapter El we took it to be order unity. 
Thus both terms appear at the same order above (the first term has a spatial 
gradient and an electric field, and the second term has two spatial gradients), 
whereas in Chapter El the second term would appear at one higher order than 
the first term (and we did not calculate to high enough order there to see it). It 
is also interesting to note that the term involving TZ in Eq. fj3.51ip of Chapter El 
is a three (or higher) dimensional effect, and has no analog in two-dimensional 
guiding-center motion. 
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Appendix D 

Properties of the Special 
Functions 



The following is a list of properties of the and Ri functions that follow directly 
from their definitions given in Section 14.41 

D.0.9 The Q Functions 
Property D.0.9. 1 (Defining Integral) 



271" Jo V ixsin^ 
Property D.0.9. 2 (Relationship with Bessel Functions) 



^ [xQi{x)] = Ji{x) 



Property D.0.9. 3 (Power Series) 

{2j + £ + l)j\{i + j)l {£+1) 



j=o 

Property D.0.9. 4 (Asymptotic Behavior for Large Argument) 
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Properties of the Special Functions 



Property D.0.9.5 (Recursion Relations) 

Qe-i{x) + Qi+i{x) = — dy- 



X Jo y 
2 

Qi-i{x) - (5^+i(x) = -Jt.{x) 

X 

Property D.0.9.6 (Formula for Derivative) 



Q[{x) = -[J,{x)-Q,{x)] 
Graphs of the Q functions are presented in Fig. ID.IL 



D.0.10 The R Functions 
Property D. 0.10.1 (Defining Integral) 

Jo \ x^ sin ^ / 

Property D.0.10. 2 (Relationship with Bessel Functions) 



— = 2xJi{x) 



Property D. 0.10.3 (Power Series) 

{2j + e + 2)j\{£ + jy. {i + 2)i\ 



r,. (-l)^(x/2)W _ 2{x/2y 



Property D. 0.10.4 (Asymptotic Behavior for Large Argument) 



Property D.0.10. 5 (Recursion Relations) 

Re-i{x) + Re+i{x) = —Qi{x) 

X 

4 

Ri-i{x) - R£+i{x) = - [Je{x) - Qi{x) 

X 
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Figure D.l: The Q Functions 
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Property D. 0.10.6 (Formula for Derivative) 



Graphs of the R functions are presented in Fig. ID .21 
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Figure D.2: The R Functions 
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Appendix E 

Useful Bessel Function 
Sums 



All of the Bessel function summation formulas used in Chapter 0] can be derived 
from the following theorems: 

J2 Ji+k (z) Ji-k (z) = 5ko (E.953) 

e 

and 

J2 Ji+k+i{z) Ji-k (z) = 0, (E.954) 

i 

the usual Bessel function recursion relations 

Of 

J,_i(z) + Je+i{z) = —Je{z) (E.955) 

and 

J,_i(z) - J,+i(z) = 2j;(z), (E.956) 

and the parity rule 

J.iiz) = i-lYMz). (E.957) 
To verify Eq. fjE.95311 . let us define 

fk{z) = J2 Ji+kiz) Ji-k (z) , (E.958) 

i 
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Useful Bessel Function Sums 



and differentiate with respect to z to get 





[{Ji+k-i — Je+k+i) Ji-k + Je+k {Ji-k-i — Ji-k+i)] 



{j'i+kJi-k + Ji+kJt-k) 




^ {Ji+k-lJi-k — Ji+k+lJi-k + Ji+k+lJi-k ~ Ji+k-lJi-k) 



0, (E.959) 



where we have used Eq. flE.956|l in the second hne and have redefined the sum- 
mation variable in the third Hne (we have also omitted explicit indication of the 
functional dependence of Ji on z since no ambiguity can result from doing so). 
This means that fk{z) cannot depend on z, so it is a constant for each value of 
k. To find the value of this constant, set z equal to zero in Eq. fjE.95311 . Recalling 
that Je{0) = dio, we see that fk{z) = dko, and the theorem is proved. 
To verify Eq. flK.954jl . use the parity rule, Eq. (jK.957j) . We have 



where we have redefined the summation variable in the first line {i ^ —i in the 
second term), and used the parity rule in the second line. 

These theorems can be used to derive sum rules with summands that are 
quadratic in the Bessel functions. To do this, note first that setting k = in 
Eqs. fjE.953|l and f)E.954jl immediately yields 




= 2^ (j^+k+iJ^-^ + i-ir^'Ji+k+iJi-k) 



e 




e 



(E.960) 




(E.961) 
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and 




(E.962) 



To derive a sum rule that includes £ raised to some power, first use Eq. f)E.955|l to 
get rid of the power of i. To derive a sum rule that includes a derivative of a Bessel 
function, first use Eq. ()E.956|1 to express the Bessel function derivative in terms 
of undifferentiated Bessel functions; alternatively, if a sum rule that includes 
a Bessel function derivative can be expressed as the derivative of another sum 
rule with undifferentiated Bessel functions, then this is usually a better way to 
proceed. 

As an example of some generality, consider the sum over i of i^JiJ'^. This 
can be expressed as follows: 




(E.963) 



Now note 




— j— {Ji-l + 2J^-l J^+l + Jl+l) 



+2[(£-l)(£ + l) + l] J,_iJ,+i 



} 





so that application of our theorems to this last equation yields 
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+2 



+ 



— (0 + + 1 + 0) + 



;i + 2 -0 + 1)- 2(0 + 0) + 1 



Thus, we finally get 



(E.965) 



(E.966) 



2 4 

The following is a list of useful results that can be established in the above 
manner: 



e 

E 

e 

£ 

E 

e 



JiJg^ = 
iJiJe+ = - 

rj — * n-+ 



P 



V2Xb 



Fn-k 



Xb 

{Jl-1 — Jl+l) Ji 



V2 



Fn-k. 



(E.967) 
(E.968) 

(E.969) 
(E.970) 

(E.971) 



These sum rules are needed in the proof that the results for K2 in Eqs. fj4.717p . 
fl4 .71911 and ()4.735j] are indeed the same. 



